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HETEROLOGOUS G-CSF FUSION PROTEIN 



(57) Abstract: The present invention encompasses heterologous fusionS proteins comprising a hyperglycsoylated G-CSF analog 
Q fusedto proteins such as albumin and the Fc portion of animmunoglobulin which act to extend the in vivo half-life ofthe protein 
^ compared to native G-CSF. These fusion proteinsare particularly suited for the treatment of conditionslO treatable by stimulation of 
^ circulating neutrophils, such as after chemotherapy regimens or in chronic congenital neutropenia. 
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HETEROLOGOUS G-CSF FUSION PROTEINS 

The present invention relates to heterologous fusion 
proteins, including analogs and derivatives thereof, fused 
to proteins that have the effect of extending the in vivo 
half -life of the proteins. These, fusion proteins are 
significant in human medicine, particularly in the treatment 
of conditions treatable by stimulation of circulating 
neutrophils, such as after chemotherapy regimens or in 
chronic congenital neutropenia. More specifically, the 
invention relates to novel heterologous fusion proteins with 
granulocyte-colony stimulating factor activity. 

Among all blood cell lineages, the modulation of 
neutrophil and platelet production has been of highest 
interest to clinical oncologists and hematologists . 
Myelosuppression is the single most severe complication of 
cancer chemotherapy, and a major cause of treatment delay 
during multiple -cycle or combination chemotherapy. It is 
also the major dose-limiting factor for most 
chemotherapeutic agents. Due to the short half -lives of 
neutrophils in peripheral blood, life-threatening falls in 
neutrophil levels are seen, after a number of conventional 
anti-tumor chemotherapy regimens. 

The most prominent regulator of granulopoiesis is 
granulocyte-colony stimulating factor (G-CSF) . G-CSF 
induces proliferation and differentiation of hematopoietic 
progenitor cells resulting in increased numbers of 
circulating neutrophils. G-CSF also stimulates the release 
of mature neutrophils from bone marrow and activates their 
functional state. [Souza L.M., et al . (1986) Science 
232:61-65]. Thus, therapeutic proteins with G-CSF activity 
have tremendous value in situations where there are reduced 
circulating levels of neutrophilic granuloctyes . 

However, the usefulness of therapy using G-CSF peptides 
has been limited by their short plasma half -life. Thus, 
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they must be administered intravenously or subcutaneously at 
fairly frequent intervals (once or twice a day) in order to 
maintain their neutrophil stimulating properties. In 
addition, this short half-life limits the performance of the 
drug to traditional drug delivery systems. It would clearly 
benefit the treatment of patients with abnormally low 
neutrophils, and reduce the discomfort and inconvenience 
associated with frequent injections to provide a 
pharmaceutical agent that could be administered less 
frequently and optionally by alternative routes of 
administration. Thus, a need exists to develop agents that 
stimulate the production of mature neutrophils and are more 
optimal in their duration of effect. 

The present invention overcomes the problems associated 
with delivering a compound that has a short plasma half-life 
in two respects. First, G-CSF is hyperglycosylated. The 
carbohydrate content of G-CSF is altered by substituting 
amino acids that can act as substrates for glycosylating 
enzymes in mammalian cells. Most significantly, the present 
invention encompasses fusion of these hyperglycosylated G- 
CSF analogs to another protein with a long circulating half- 

+ 

life such as the Fc portion of an immunoglobulin or albumin. 

Compounds of the present invention include heterologous 
fusion proteins comprising a hyperglycosylated G-CSF analog 
fused to a polypeptide selected from the group consisting of 

a) human albumin; 

b) human albumin analogs; and 

c) fragments of human albumin. 

Compounds of the present invention also include heterologous 
fusion proteins comprising a hyperglycosylated G-CSF analog 
fused to a polypeptide selected from the group consisting of 

a) human albumin ; 

b) human albumin analogs; and 

c) fragments of human albumin, 



i 
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wherein the hyperglycosylated G-CSF analog is fused to the 
polypeptide via a peptide linker. 

Additional compounds of the present invention include a 
heterologous fusion protein comprising a hyperglycosylated 
G-CSF analog fused to a polypeptide selected from the group 
consisting of 

a) the Fc portion of an immunoglobulin; 

b) an analog of the Fc portion of an immunoglobulin; 
and 

c) fragments of the Fc portion of an immunoglobulin. 
The G-CSF analog may be fused to the polypeptide via a 
peptide linker. It is preferable that the peptide linker is 
selected from the group consisting of: 

a) a glycine rich peptide; 

b) a peptide having the sequence [Gly-Gly-Gly-Gly- 
Ser] n where n is 1, 2, 3, 4, or 5; and 

c) a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] 3 
The present invention further provides data showing 

that these G-CSF analogs are glycosylated in mammalian cells 
and retain their activity. 

One aspect of the present invention includes 
heterologous fusion proteins, wherein the hyperglycosylated 
G-CSF analogs have the Formula (I) [SEQ ID NO:l] 
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Pro 
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Ala 
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Gin 
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Lys 


He 


Gin 


Gly Asp 


Gly 


Ala 


Ala 


Leu 


Gin 
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40 










45 
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Lys 


Leu 
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Xaa 


Xaa 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 
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55 










60 
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Leu 
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Leu 
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He 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
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Xaa 


Xaa 


Xaa 


Xaa 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 
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Gly 


Leu 


Phe 


Leu 
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Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Xaa 


Xaa 


Xaa 
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Leu 


Gly 
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Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 
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Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


Met 


Ala 


Pro 
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Ala 


Phe 
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Xaa 


Phe 
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165 170 
Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro (I) 

wherein: 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 133 is Thr or Asn; 

Xaa at position 134 is Gin or any other amino acid except 
Pro; 

Xaa at position 135 is Gly, Ser, or Thr 
Xaa at position 141 is Ala or Asn; 

Xaa at position 142 is Ser or any other amino acid except 
Pro; and 

Xaa at position 143 is Ala, Ser, or Thr; 



17 


is 


Cvs . 


Ala. Leu, Ser, or Glu; 
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or Asn ■ 
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is 


Thr, 


or any other amino acid 


except 
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is 


Ala 


or any other amino acid 


except 


fa U 


* 

1 s 


T5 "v* ^\ 


1111/ HSIl; UX Sex / 




D 1 


• 

IS 


jjeu , 


or &ny ouncx aiimiu o^> j.u 


#u "V" f"> PT>1~ 
CA. V* C W L. 


62 


is 


Ser 


or Thr; 




o 6 


* 

1 S 


ber 


or Asn; 




64 


* 

IS 


Cys 


or any other ammo acia 


except. 


65 


is 


Pro, 


Ser, or Thr; 




66 


is 


Ser 


or Thr; 




67 


is 


Gin 


or Asn; 




68 


is 


Ala 


or any other amino acid 


except 


69 


is 


Leu, 


Thr, or Ser 




93 


is 


Glu 


or Asn 




94 


is 


Gly 


or any other amino acid 


except 


95 


is 


lie, 


Asn, Ser, or Thr; 




97 


is 


Pro, 


Ser, Thr, or Asn; 





and wherein: 

Xaa at positions 37, 38, 

Xaa at positions 58, 59, 

Xaa at positions 59, 60, 

Xaa at positions 60, 61, 

Xaa at positions 61, 62, 



and 3 9 constitute region 1 

and 60 constitute region 2 

and 61 constitute region 3 

and 62 constitute region 4 

and 63 constitute region 5 
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Xaa 



Xaa 



Xaa 



Xaa 



Xaa 



at positions 62, 

at ' positions 63 , 

at positions 64 , 

at positions 67, 

at positions 93, 



63, and 

64, and 

65, and 
6 8 , and 
94, and 



64 constitute region 6; 

65 constitute region 7; 

66 constitute region 8 ; 
69 constitute region 9; 



95 constitute region 10; 



Xaa at positions 94, 95, and Ser at position 96 

constitute region 11; 
Xaa at positions 95, and 97, and Ser at position 96 

constitute region 12; 
Xaa at positions 133, 134, and 135 constitute 

region 13; 

Xaa at positions 141, 142, and 143 constitute 
region 14 ; 

and provided that at least one of regions 1 through 14 
comprises the sequence Asn Xaal Xaa2 wherein Xaal is any 
amino acid except Pro and Xaa2 is Ser or Thr. 

Thus, the heterologous fusion proteins of the present 
invention include analogs wherein one or any combination of 
two or more regions comprise the sequence Asn Xaal Xaa2 
wherein Xaal is any amino acid except Pro and Xaa2 is Ser or 
Thr. 

Preferred hyperglycosylated G-CSF analogs that make up part 
of the heterologous fusion proteins of the present 
invention, include the following: 



a) 


G- 


■CSF[A37N, Y39T] 


b) 


G- 


■CSF[P57V,W58N, P60T] 


c) 


G- 


■CSF[P60N,S62T] 


d) 


G- 


■CSF[S63N, P65T] 


e) 


G- 


■CSF[Q67N,L69T] 


f ) 


G- 


■CSF[E93N, I95T] 


g) 


G- 


■CSF[T133N,G135T] 


h) 


G- 


■CSF[A141N, A143T] 


l) 


G- 


CSF [A37N, Y39T, P57V, W58N, P60T] 


d) 


G- 


■CSF[A37N, Y39T, P60N,S62T] 


k) 


G- 


■CSF [A37N, Y39T, S63N', P65T] 


1) 


G- 


CSF[A37N, Y39T i Q67N,L69T] 


m) 


G- 


CSF[A37N, Y39T,E93N, I95T] 


n) 


G- 


CSF [ A3 7N , Y3 9T , T13 3N , Gl 3 5T] 


o) 


G- 


CSF [ A3 7N , Y3 9T , A14 IN , Al 4 3T] 
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p) G-CSF [A37N, Y39T, P57V, W58N, P60T, S63N, P65T] 
q) G-CSF [A37N, Y39T, P57V, W58N, P60T, Q67N, L69T] 
r) G-CSF [A37N,Y39T,S63N,P65T,E93N, I95T] 

The present invention also includes heterologous fusion 
proteins, which are the product of the expression in a host 
cell of an exogenous DNA sequence, which comprises a DNA 
sequence encoding a heterologous fusion protein of Formula I 
(described above) fused to a DNA sequence encoding human 
albumin or the Fc portion of an immunolglobulin . 

The present invention includes an isolated nucleic acid 
sequence, comprising a polynucleotide encoding a 
heterologous fusion protein described above. Exemplary 
isolated nucleic acids of the present invention include 
isolated nucleic acid sequence comprising a 
hyperglycosylated G-CSF analog selected from the group 

consisting of: 

a) SEQ ID NO: 2 



ACC 
TGG 


CCC 
GGG 


CTG 
GAC 


GGC 
CCG 


CCT 
GGA 


GCC 
CGG 


AGC 
TCG 


TCC 
AGG 


CTG 
GAC 


CCC 
GGG 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 


CTG 
GAC 


CTC 
GAG 


AAG 
TTC 


GCC 
CGG 


TTA 
AAT 


GAG 
CTC 


CAA 
GTT 


GTG 
CAC 


AGG 
TCC 


AAG 
TTC 


ATC 
TAG 


CAG 
GTC 


GGC 
CCG 


GAT 
CTA 


GGC 
CCG 


GCA 
CGT 


GCG 
CGC 

* 


CTC 
GAG 


CAG 
GTC 


GAG 
CTC 


AAG 
TTC 


CTG 

GAC 


TGT 
ACA 


GCC 
CGG 


ACC 
TGG 


TAC 
ATG 


AAG 
TTC 


CTG 
GAC 


TGC 
ACG 


CAC 
GTG 


CCC 
GGG 


GAG 
CTC 


GAG 
CTC 


CTG 
GAC 


GTG 
CAC 


CTG 
GAC 


CTC 
GAG 


GGA 
CCT 


CAC 
GTG 


TCT 
ACA 


CTG 
GAC 


GGC 
CCG 


ATC 
TAG 


CCC 
GGG 


TGG 
ACC 


GCT 
CGA 


CCC 
GGG 


CTG 
GAC 


AGC 
TCG 


AGC 
TCG 


TGC 
ACG 


CCC 
GGG 


AGC 
TCG 


CAG 
GTC 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CTG 
GAC 


GCA 
CGT 


GGC 
CCG 


TGC 
ACG 


TTG 
AAC 


AGC 
TCG 


CAA 
GTT 


CTC 
GAG 


CAT 
GTA 


AGC 
TCG 


GGC 
CCG 


CTT 
GAA 


TTC 
AAG 


CTC 
GAG 


TAC 
ATG 


CAG 
GTC 


GGG 
CCC 


CTC 
GAG 


CTG 
GAC 


CAG 
GTC 


GCC 
CGG 


CTG 
GAC 


GAA 
CTT 


GGG 
CCC 


ATC 
TAG 


TCC 
AGG 


CCC 
GGG 


GAG 
CTC 


TTG 
AAC 


GGT 
CCA 


CCC 
GGG 


ACC 
TGG 


TTG 
AAC 


GAC 
CTG 


ACA 
TGT 


CTG 
GAC 


CAG 
GTC 


CTG 
GAC 


GAC 
CTG 


GTC 
CAG 


GCC 
CGG 


GAC 
CTG 


TTT 
AAA 


GCC 
CGG 


ACC 
TGG 


ACC 
TGG 


ATC 
TAG 


TGG 
ACC 


CAG 
GTC 


CAG 
GTC 


ATG 
TAC 


GAA 
CTT 


GAA 
CTT 


CTG 
GAC 


GGA 
CCT 


ATG 
TAC 


GCC 
CGG 


CCT 
GGA 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CCC 
GGG 


AAC 
TTG 


CAG 
GTC 


ACC 
TGG 


GCC 
CGG 


ATG 
TAC 


CCG 
GGC 


GCC 
CGG 


TTC 
AAG 


GCC 
CGG 


TCT 
AGA 


GCT 
CGA 


TTC 
AAG 


CAG 
GTC 


CGC 
GCG 


CGG 
GCC 


GCA 
CGT 


GGA 
CCT 


GGG 
CCC 


GTC 
CAG 


CTG 
GAC 


GTT 
CAA 


GCC 
CGG 


TCC 
AGG 


CAT 
GTA 


CTG 
GAC 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 
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CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

b) SEQ ID NO: 3 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC AAC TCT ACC TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG TTG AGA TGG AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

C) SEQ ID NO: 4 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG QCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 
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TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAJ GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

d) SEQ ID NO: 5 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT AAC ACT AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA TTG GAC TCC TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA. ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

e) SEQ ID NO:6 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

♦ 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AAT TGC 
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GAC GAG CCT GTG ACA GAC 

ACC AGC CAG GCC CTG CAG 
TGG TCG GTC CGG GAC GTC 

GGC CTT TTC CTC TAC CAG 
CCG GAA AAG GAG ATG GTC 

CCC GAG TTG GGT CCC ACC 
GGG CTC AAC CCA GGG TGG 

TTT GCC ACC ACC ATC TGG 
AAA CGG TGG TGG TAG ACC 

GCC CTG CAG CCC ACC CAG 
CGG GAC GTC GGG TGG GTC 

CAG CGC CGG GCA GGA GGG 
GTC GCG GCC CGT CCT CCC 

CTG GAG GTG TCG TAC CGC 
GAC CTC CAC AGC ATG GCG 

f) SEQ ID NO: 7 

ACC CCC CTG GGC CCT GCC 

TGG GGG GAC CCG GGA CGG 



GCC TTA GAG CAA GTG AGG 
CGG AAT CTC GTT CAC TCC 

GAG AAG CTG TGT GCC ACC 
CTC TTC GAC ACA CGG TGG 

CTG CTC GGA CAC TCT CTG 
GAC GAG CCT GTG ACA GAC 

CCC AGC CAG GCC CTG CAG 
GGG TCG GTC CGG GAC GTC 

GGC CTT TTC CTC TAC CAG 
CCG GAA AAG GAG ATG GTC 

CCC GAG TTG GGT CCC ACC 
GGG CTC AAC CCA GGG TGG 

TTT GCC ACC ACC ATC TGG 
AAA CGG TGG TGG TAG ACC 

GCC CTG CAG CCC ACC CAG 
CGG GAC GTC GGG TGG GTC 

CAG CGC CGG GCA GGA GGG 
GTC GCG GCC CGT CCT CCC 

CTG GAG GTG TCG TAC CGC 
GAC CTC CAC AGC ATG GCG 



-9- 

CCG TAG GGG ACC CGA 

CTG GCA GGC TGC TTG 
GAC CGT CCG ACG AAC 

GGG CTC CTG CAG GCC 
CCC GAG GAC GTC CGG 

TTG GAC ACA CTG CAG 
AAC CTG TGT GAC GTC 

CAG CAG ATG GAA GAA 
GTC GTC TAC CTT CTT 

GGT GCC ATG CCG GCC 
CCA CGG TAC GGC CGG 

GTC CTG GTT GCC TCC 
CAG GAC CAA CGG AGG 

GTC TTA AGG CAC CTT 
CAG AAT TCC GTG GAA 



AGC TCC CTG CCC CAG 
TCG AGG GAC GGG GTC 



AAG ATC CAG GGC GAT 
TTC TAG GTC CCG CTA 

TAC AAG CTG TGC CAC 
ATG TTC GAC ACG GTG 

GGC ATC GTT AAC GCT 
CCG TAG CAA TTG CGA 

CTG GCA GGC TGC TTG 
GAC CGT CCG ACG AAC 

GGG CTC CTG CAG GCC 
CCC GAG GAC GTC CGG 

TTG GAC ACA CTG CAG 
AAC CTG TGT GAC GTC 

CAG CAG ATG GAA GAA 
GTC GTC TAC CTT CTT 

GGT GCC ATG CCG GCC 
CCA CGG TAC GGC CGG 

GTC CTG GTT GCC TCC 
CAG GAC CAA CGG AGG 

GTC TTA AGG CAC CTT 
CAG AAT TCC GTG GAA 



PCT/US03/03120 



GGG GAC TCG TTA ACG 

AGC CAA CTC CAT AGC 
TCG GTT GAG GTA TCG 

CTG GAA GGG ATC TCC 
GAC CTT CCC TAG AGG 

CTG GAC GTC GCC GAC 
GAC CTG CAG CGG CTG 

CTG GGA ATG GCC CCT 
GAC CCT TAC CGG GGA 

TTC GCC TCT GCT TTC 
AAG CGG AGA CGA AAG 

CAT CTG CAG AGC TTC 
GTA GAC GTC TCG AAG 

GCC CAG CCC 
CGG GTC GGG 



AGC TTC CTG CTC AAG 
TCG AAG GAC GAG TTC 



GGC GCA GCG CTC CAG 
CCG CGT CGC GAG GTC 

CCC GAG GAG CTG GTG 
GGG CTC CTC GAC CAC 

ACC CTG AGC AGC TGC 
TGG GAC TCG TCG ACG 

AGC CAA CTC CAT AGC 
TCG GTT GAG GTA TCG 

CTG GAA GGG ATC TCC 
GAC CTT CCC TAG AGG 

CTG GAC GTC GCC GAC 
GAC CTG CAG CGG CTG 

CTG GGA ATG GCC CCT 
GAC CCT TAC CGG GGA 

TTC GCC TCT GCT TTC 
AAG CGG AGA CGA AAG 

CAT CTG CAG AGC TTC 
GTA GAC GTC TCG AAG 

GCC CAG CCC 
CGG GTC GGG 



* 
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g) SEQ ID NC 
ACC CCC CTG 
TGG GGG GAC 


>:8 

GGC 

CCG 


CCT 
GGA 


GCC 
CGG 


AGC 
TCG 


TCC 
AGG 


CTG 
GAC 


CCC 
GGG 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 


CTG 
GAC 


CTC 
GAG 


AAG 
TTC 


GCC 
CGG 


TTA 
AAT 


GAG 
CTC 


CAA 

GTT 


GTG 
CAC 


AGG 
TCC 


AAG 
TTC 


ATC 
TAG 


CAG 
GTC 


GGC 
CCG 


GAT 
CTA 


GGC 
CCG 


GCA 
CGT 


GCG 
CGC 


CTC 
GAG 


CAG 
GTC 


GAG 
CTC 


AAG 
TTC 


CTG 
GAC 


TGT 


GCC 


ACC 

1 wo 


TAC 


AAG 

TTC 


CTG 

RAP 


TGC 
ACG 


CAC 
GTG 


CCC 
GGG 


GAG 
CTC 


GAG 
CTC 


CTG 
GAC 


GTG 
CAC 


CTG 
GAC 


CTC 
GAG 


GGA 
CCT 


CAC 

GTG 


TCT 

ACA 


CTG 


GGC 


ATC 


CCC 


TGG 
app 


GCT 


CCC 

VJww 


CTG 

GAG 


AGC 
TCG 


AGC 
TCG 


TGC 
ACG 


CCC 
GGG 


AGC 
TCG 


AAC 
TTG 


GCC 
CGG 


ACC 

TGG 


CAG 


CTG 


GCA 

Lb Jl 


GGC 

ppp 

CCG 


TGC 

ALU 


TTG 

21 HC 


AGC 


CAA 

\3 X X 


CTC 


CAT 
GTA 


AGC 
TCG 


GGC 
CCG 


CTT 
GAA 


TTC 
AAG 


CTC 
GAG 


TAC 
ATG 


CAG 

GTC 


GGG 


CTC 

TV 

GAG 


CTG 

GAC 


CAG 

pmp 
G± C 


GCC 

Cub 


CTG 


GAA 

PTT 


GGG 


ATC 
TAG 


TCC 
AGG 


CCC 
GGG 


GAG 
CTC 


TTG 
AAC 


GGT 
CCA 


CCC 

/-» 

GGG 


ACC 
IGG 


TTG 

TV 7* O 

AAC 


GAC 

C1G 


ACA 

1 G 1 


CTG 

p7\p 

bAC 


CAG 

pmp 


CTG 

oj\p» 


GAC 

I** X w 


GTC 

GAG 


GCC 
CGG 

* 


GAC 
CTG 


TTT 
AAA 


GCC 
CGG 


ACC 
TGG 


ACC 

1 Gu 


ATC 

1 Ao 


TGG 


CAG 
cvrc 


CAG 


ATG 

TAP 


GAA 

PTT 

X X 


GAA 

CTT 


CTG 


GGA 
CCT 


ATG 
TAC 


GCC 
CGG 

^krf ^Mb* 


CCT 
GGA 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CCC 
GGG 


ACC 
TGG 


CAG 
GTC 


GGT 
CCA 


GCC 
CGG 


ATG 
TAC 


CCG 
GGC 


GCC 
CGG 


TTC 
AAG 


GCC 
CGG 


TCT 
AGA 


GCT 
CGA 


TTC 
AAG 


CAG 
GTC 


CGC 
GCG 


CGG 
GCC 


GCA 
CGT 


GGA 
CCT 


GGG 
CCC 


GTC 
CAG 


CTG 
GAC 


GTT 
CAA 


GCC 
CGG 


TCC 
AGG 


CAT 
GTA 


CTG 
GAC 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 


CTG 
GAC 


GAG 
CTC 


GTG 
CAC 


TCG 
AGC 


TAC 
ATG 


CGC 
GCG 


GTC 
CAG 


TTA 
AAT 


AGG 
TCC 


CAC 
GTG 


CTT 
GAA 


GCC 
CGG 


CAG 
GTC 


CCC 
GGG 






h) SEQ ID NO: 9 

ACC CCC CTG GGC 
TGG GGG GAC CCG 


CCT 

or 1 & 


GCC 

ctzn 


AGC 

X WU 


TCC 


CTG 

GZVC 


CCC 
ceo. 


CAG 

GTC 


AGC 

TGG 


TTC 
AAG 


CTG 
GAC 


CTC 
GAG 


AAG 
TTC 


GCC 
CGG 


TTA 
AAT 


GAG 
CTC 


CAA 
GTT 


GTG 

CTiC 


AGG 

x 


AAG 

XXV- 


ATC 
TAG 

X **w 


CAG 

w X \*» 


GGC 

CCG 


GAT 
CTA 


GGC 
CCG 


GCA 
CGT 


GCG 
CGC 

^■B' 


CTC 
GAG 


CAG 
GTC 


GAG 
CTC 


AAG 
TTC 


CTG 
GAC 


TGT 
ACA 


GCC 

CGG 


ACC 
TGG 


TAC 
ATG 


AAG 
TTC 


CTG 
GAC 


TGC 
ACG 


CAC 
GTG 


CCC 
GGG 


GAG 
CTC 


GAG 
CTC 


CTG 
GAC 


GTG 
CAC 


CTG 
GAC 


CTC 
GAG 


GGA 
CCT 


CAC 
GTG 


TCT 
ACA 


CTG 
GAC 


GGC 
CCG 


ATC 
TAG 


CCC 
GGG 


TGG 
ACC 


GCT 
CGA 


CCC 
GGG 


CTG 
GAC 


AGC 
TCG 


AGC 
TCG 


TGC 
ACG 


CCC 
GGG 


AGC 
TCG 


CAG 
GTC 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CTG 
GAC 


GCA 
CGT 


GGC 
CCG 


TGC 
ACG 


TTG 
AAC 


AGC 
TCG 


CAA 
GTT 


CTC 
GAG 


CAT 
GTA 


AGC 
TCG 


GGC 
CCG 


CTT 
GAA 


TTC 
AAG 


CTC 
GAG 


TAC 
ATG 


CAG 
GTC 


GGG 
CCC 


CTC 
GAG 


CTG 
GAC 


CAG 
GTC 


GCC 
CGG 


CTG 
GAC 


AAC 
TTG 


GGG 
CCC 


ACC 
TGG 


TCC 
AGG 


CCC 
GGG 


GAG 
CTC 


TTG 
AAC 


GGT 
CCA 


CCC 
GGG 


ACC 
TGG 


TTG 
AAC 


GAC 
CTG 


ACA 
TGT 


CTG 
GAC 


CAG 
GTC 


CTG 
GAC 


GAC 
CTG 


GTC 
CAG 


GCC 
CGG 


GAC 
CTG 


TTT 
AAA 


GCC 
CGG 


ACC 
TGG 


ACC 
TGG 


ATC 
TAG 


TGG 
ACC 


CAG 
GTC 


CAG 
GTC 


ATG 
TAC 


GAA 
CTT 


GAA 
CTT 


CTG 
GAC 


GGA 
CCT 


ATG 
TAC 


GCC 
CGG 


CCT 
GGA 
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GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

i) SEQ ID NO: 10 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
■AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC AAC CAG ACC GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TTG GTC TGG CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

j) SEQ ID NO: 11 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 
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CCC 

GGG 


AGC 

TCG 


CAG 


GCC 


CTG 
GAC 


CAG 

GTC 


CTG 
GAC 


GCA 
CGT 


GGC 
CCG 

Xa* T—J 


TGC 
ACG 


TTG 
AAC 


AGC 
TCG 


CAA 
GTT 


CTC 
GAG 


CAT 
GTA 


AGC 
TCG 


GGC 
CCG 


CTT 

/*• TV TV 

GAA 


TTC 

AAG 


CTC 


TAG 

ATf2 


CAG 


GGG 

CCC 


CTC 
GAG 


CTG 
GAC 


CAG 
GTC 


GCC 
CGG 


CTG 
GAC 


GAA 
CTT 


GGG 
CCC 


ATC 
TAG 


TCC 
AGG 


CCC 

^M- 

GGG 


GAG 
CTC 


TTG 
AAC 


GGT 
CCA 


CCC 


ACC 

1GG 


TTG 

B IP 

AAL 


GAC 

Liu 


ACA 

1G X 


CTG 
rap 

unL 


CAG 

PTP 


CTG 

ftAP 


GAC 

PTfl 


GTC 


GCC 

CGG 


GAC 
CTG 


TTT 

m *l *k 

AAA 


GCC 
CGG 


ACC 
TGG 


ACC 
TGG 


ATC 

TAG 


TGG 

ALL 


CAG 

vjIL 


CAG 

olL 


ATG 

XAL 


GAA 

LI 1 


GAA 

LI 1 


CTG 

CI AP 


GGA 

PPT 


ATG 

TAP 


GCC 

CGG 


CCT 
GGA 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CCC 
GGG 


ACC 
TGG 


CAG 
GTC 


GGT 
CCA 


GCC 
CGG 


ATG 
TAC 


CCG 
GGC 


GCC 
CGG 


TTC 
AAG 


AAC 
TTG 


TCT 
AGA 


ACC 
TGG 


TTC 
AAG 


CAG 
GTC 


CGC 
GCG 


CGG 
GCC 


GGA 
CGT 


GGA 
CCT 


GGG 
CCC 


GTC 
CAG 


CTG 
GAC 


GTT 
CAA 


GCC 
CGG 


TCC 
AGG 


CAT 
GTA 


CTG 
GAC 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 


CTG 
GAC 


GAG 
CTC 


GTG 
CAC 


TCG 
AGC 


TAC 
ATG 


CGC 
GCG 


GTC 
CAG 


TTA 
AAT 


AGG 
TCC 


CAC 
GTG 


CTT 
GAA 


GCC 
CGG 


CAG 
GTC 


CCC 
GGG 






k) 

ACC 

TGG 


SEQ 
CCC 
GGG 


ID NO: 12 
CTG GGC CCT 
GAC CCG GGA 


GCC 
CGG 


AGC 
TCG 


TCC 
AGG 


CTG 
GAC 


CCC 
GGG 


CAG 
GTC 


AGC 
TCG 


TTC 

TV T\ f—\ 

AAG 


CTG 

GAL 


CTC 

P7AP 


AAG 

TTP 
XXL 


GCC 
CGG' 


TTA 
AAT 


GAG 
CTC 


CAA 
GTT 


GTG 
CAC 


AGG 

M 

TCC 


AAG 
TTC 


ATC 
TAG 


CAG 
GTC 


GGC 
CCG 


GAT 

iv 

CTA 


GGC 
CCG 


GCA 
CGT 


GCG 

CGL 


CTC 

GAG 


CAG 

PTP 
LiL 


GAG 
CTC 


AAG 
TTC 


CTG 
GAC 


TGT 
ACA 


AAC 
TTG 


ACC 

■^B. M 

TGG 


ACC 

m a 

TGG 


AAG 
TTC 


CTG 
GAC 


TGC 

^^^^ 

ACG 


CAC 
GTG 


CCC 
GGG 


GAG 
CTC 


GAG 

LI L 


CTG 

GAL 


GTG 

LnL 


CTG 
GAC 


CTC 
GAG 


GGA 
CCT 


CAC 
GTG 


TCT 
ACA 


CTG 
GAC 


GGC 
CCG 


ATC 
TAG 


GTT 
CAA 


AAC 
TTG 


GCT 
CGA 


ACC 
TGG 


CTG 

GAC 


AGC 

1LG 


AGC 

X LG 


TGC 

APf3 
ALVj 


CCC 

GGG 


AGC 
TCG 


CAG 
GTC 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CTG 

GAC 


GCA 

CGT 


GGC 

CCG 


TGC 

ACG 


TTG 

AAC 


AGC 

1 Lb 


CAA 

PTT 

GJl x 


CTC 
nun. 


CAT 

HTA 

O X A 


AGC 

tph 


GGC 

.^^m .^^H. 

CCG 


CTT 
GAA 


TTC 
AAG 


CTC 
GAG 


TAC 
ATG 


CAG 

GTC 


GGG 

CLL 


CTC 

f T\ p* 

GAG 


CTG 
LjAL 


CAG 

ulL 


GCC 

pip*P* 

LGG 


CTG 

GAL 


GAA 

PTT 
L 1 1 


GGG 

CCC 


ATC 

TAG 


TCC 
AGG 


CCC 
GGG 


GAG 

/"ill i^i 

CTC 


TTG 
AAC 


GGT 
CCA 


CCC 
GGG 


ACC 

X OVj 


TTG 

A AP 


GAC 

PTfZ 


ACA 

X W X 


CTG 

HAP 

WAV* 


CAG 
fiTP 

\D X \m» 


CTG 


GAC 
CTG 


GTC 
CAG 


GCC 
CGG 


GAC 
CTG 


TTT 

AAA 


GCC 

CGG 


ACC 
TGG 


ACC 
TGG 


ATC 
TAG 


TGG 
ACC 


CAG 
GTC 


CAG 
GTC 


ATG 
TAC 


GAA 
CTT 


GAA 
CTT 


CTG 
GAC 


GGA 
CCT 


ATG 
TAC 


GCC 
CGG 


CCT 
GGA 


GCC 
CGG 


CTG 
GAC 


CAG 
GTC 


CCC 
GGG 


ACC 
TGG 


CAG 
GTC 


GGT 
CCA 


GCC 
CGG 


• ATG 
TAC 


CCG 
GGC 


GCC 
CGG 


TTC 
AAG 


GCC 
CGG 


TCT 
AGA 


GCT 
CGA 


TTC 
AAG 


CAG 
GTC 


CGC 
GCG 


CGG 
GCC 


GCA 
CGT 


GGA 
CCT 


GGG 
CCC 


GTC 
CAG 


CTG 
GAC 


GTT 
CAA 


GCC 
CGG 


TCC 
AGG 


CAT 
GTA 


CTG 
GAC 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 


CTG 
GAC 


GAG 
CTC 


GTG 
CAC 


TCG 
AGC 


TAC 
ATG 


CGC 
GCG 


GTC 
CAG 


TTA 
AAT 


AGG 
TCC 


CAC 
GTG 


CTT 
GAA 


GCC 
CGG 


CAG 
GTC 


CCC 
GGG 






1) ! 

ACC 
TGG 


SEQ : 
CCC 
GGG 


ID NO: 13 
CTG GGC 
GAC CCG 


CCT 
GGA 


GCC 
CGG 


AGC 
TCG 


TCC 
AGG 


CTG 
GAC 


CCC 
GGG 


CAG 
GTC 


AGC 
TCG 


TTC 
AAG 


"CTG 
GAC 


CTC 
GAG 


AAG 
TTC 
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GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 

CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 

CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 

GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC AAC GCC ACC CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 

GGG TCG TTG CGG TGG GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 

CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 

GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 

AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 

CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 

GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 

GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

m) SEQ ID NO: 14 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 

TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 

CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 

CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 

GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 

GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 

CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

AAC GGT ACC GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 

TTG CCA TGG CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 

AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 



GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
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CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 

n) SEQ ID NO: 15 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 

GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT AAC ACC ACC AAG 
CTC TTC GAC ACA TTG TGG TGG TTC 

CTG CTC GGA CAC TCT CTG GGC ATC 
GAC GAG CCT GTG ACA GAC CCG TAG 

CCC AGC AAC GCC ACC CAG CTG GCA 
GGG TCG TTG CGG TGG GTC GAC CGT 

GGC CTT TTC CTC TAC CAG GGG CTC 
CCG GAA AAG GAG ATG GTC CCC GAG 

CCC GAG TTG GGT CCC ACC TTG GAC 
GGG CTC AAC CCA GGG TGG AAC CTG 

TTT GCC ACC ACC ATC TGG CAG CAG 
AAA CGG TGG TGG TAG ACC GTC GTC 

GCC CTG CAG CCC ACC CAG GGT GCC 
CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 

o) SEQ ID NO: 16 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 

GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT AAC ACC ACC AAG 
CTC TTC GAC ACA TTG TGG TGG TTC 

CTG CTC GGA CAC TCT CTG GGC ATC 
GAC GAG CCT GTG ACA GAC CCG TAG 

ACC AGC CAG GCC CTG CAG CTG GCA 
TGG TCG GTC CGG GAC GTC GAC CGT 
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TAC GGC CGG AAG CGG AGA CGA AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 

CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 

GTT AAC GCT ACC CTG AGC AGC TGC 
CAA TTG CGA TGG GAC TCG TCG ACG 

GGC TGC TTG AGC CAA CTC CAT AGC 
CCG ACG AAC TCG GTT GAG GTA TCG 

CTG CAG GCC CTG GAA GGG ATC TCC 
GAC GTC CGG GAC CTT CCC TAG AGG 

ACA CTG CAG CTG GAC GTC GCC GAC 
TGT GAC GTC GAC CTG CAG CGG CTG 

ATG GAA GAA CTG GGA ATG GCC CCT 
TAC CTT CTT GAC CCT TAC CGG GGA 

ATG CCG GCC TTC GCC TCT GCT TTC 
TAC GGC CGG AAG CGG AGA CGA AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 

CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 

CCC TGG GCT CCC CTG AGC AAT TGC 
GGG ACC CGA GGG GAC TCG TTA ACG 

GGC TGC TTG AGC CAA CTC CAT AGC 
CCG ACG AAC TCG GTT GAG GTA TCG 
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A hyperglcosylated heterologous fusion protein of the 
present invention also includes polynucleotides encoding the 
heterologous fusion protein described herein, vectors 
comprising these polynucleotides and host cells transfected 
or transformed with the vectors described herein. Also 
included is a process for producing a heterologous fusion 
protein comprising the steps of transcribing and translating 
a polynucleotide described herein under conditions wherein 
the heterologous fusion protein is expressed in detectable 
amount s . 

The present invention encompasses a method for 
increasing neutrophil levels in a mammal comprising the 
administration of a therapeutically effective amount of a 
heterologous fusion protein described above. The present 
invention also includes the use of the heterologous fusion 
proteins described above for the manufacture of a medicament 
for the treatment of patients with insufficient circulating 
neutrophil levels. 

The present invention also encompasses a pharmaceutical 
formulation adapted for the treatment of patients with 
insufficient neutrophil levels comprising a glycosylated 
protein as described above. 
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BRIEF DESCRIPTION OF THE FIGURES 



The invention is further illustrated with reference to 

the following drawings: 

Figure 1: Schematic illustrating .fourteen regions in 
human G-CSF wherein the amino acid sequence can be mutated 
to create .functional glycosylation sites. 

Figure 2a: IgGl Fc amino acid sequence encompassing 
the hinge region, CH2 and CH3 domains. 

Figure 2b: IgG4 Fc amino acid sequence encompassing 
the hinge region, CH2 and CH3 domains. 

Human serum albumin amino acid sequence 



Figure 3 
Figure 4 
Figure 5 



IgGl Fc DNA sequence 

IgG4 Fc DNA sequence with Ser229Pro 



mutation. 



Figure 6 
Figure 7 
Figure 8 



G-CSF/IgGl Fc fusion protein 
G-CSF/IgG4 Fc fusion protein 
G-CSF/HA fusion protein. 



The present invention comprises a heterologous fusion 
protein. As used herein, the term heterologous fusion 
protein means a hyperglycosylated G-CSF analog fused to 
human albumin, a human albumin analog, a human albumin 
fragment, the Fc portion of an immunoglobulin, an analog of 
the Fc portion of an immunoglobulin, or a fragment of the Fc 
portion of an immunoglobulin. The G-CSF analog may be fused 
directly, or fused via a peptide linker, to an albumin or Fc 
protein. The albumin and Fc portion may be fused to the G- 
CSF analogs at either terminus or at both termini. These 
heterologous fusion proteins are biologically active and 
have an increased half -life compared to native G-CSF. 



Hyperglycosylated G-CSF Analogs 

Encompassed by the invention are certain 
hyperglycosylated analogs of G-CSF. Analogs of G-CSF refer 
to human G-CSF with one or more changes in the amino acid 



t 
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sequence which result in an increase in the number of sites 
for carbohydrate attachment compared with native human G-CSF 
expressed in animal cells in vivo. In addition, G-CSF 
analogs include human G-CSF wherein the 0-linked 
glycosylation site at position 133 is replaced with an N- 
linked glycosylation site. Analogs are generated by site 
directed mutagenesis having substitution of amino acid 
residues creating new sites that are available for 
glycosylation. Analogs having a greater carbohydrate 
content than that found in native human G-CSF are generated 
by adding glycosylation sites that do not perturb the 
secondary, tertiary, and quaternary structure required for 
activity. Furthermore, because the hyperglycosylated 
analogs of the present invention have a larger mass and an 
increased negative charge compared to native G-CSF, they 
will not be as rapidly cleared from the circulation. 

It is preferred that the G-CSF analog have 1, 2, 3, or 
4 additional sites for N-glycosylation . Figure 1 
illustrates fourteen different regions that can be 
glycosylated with very little effect on in vitro activity. 
Each region may be mutated to the consensus site for N- 
glycosylation addition which is Asn XI X2 wherein XI is any 
amino acid except Pro and X2 is Ser or Thr. It is preferred 
that the XI amino acid be any other amino acid except Trp, 
Asp, Glu, or Leu and it is most preferred that the XI amino 
acid be the naturally occurring amino acid. The scope of 
the present invention includes analogs wherein a single 
region (1 through 14) is mutated or wherein a region is 
mutated in combination with one or more other regions. 

Analogs having carbohydrate attached to only a single 
mutated site have been expressed, purified, characterized, 
and tested for activity. Similarly analogs with multiple 
glycosylation sites have been expressed, purified, 
characterized, and tested for activity. For example G- 
CSF[A37N, Y3 9T] is G-CSF wherein the amino acids at positions 
3 7 and 3 9 have been substituted to create a glycosylation 
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site- This site of carbohydrate attachment is illustrated 
as region 1 in Figure 1. G-CSF [A37N, Y39T, P57V, W58N, P60T] is 
an example of a G-CSF analog wherein amino acids in region 1 
and region 2 are mutated to provide two functional 
glycosylation sites on a single molecule (Figure 1) . 

G-CSF[A37N, Y39T, P57V, W58N, P60T, Q67N, L69T] is an example 
of a G-CSF analog wherein the amino acids in region 1, 
region 2, and region 9 are mutated to provide three 
functional glycosylation sites on a single molecule (Figure 

1) - 

Native G-CSF can be used as the backbone to create the 
glycosylated G-CSF analogs of the present invention. In 
addition, the native G-CSF backbone used to create the 
analogs of the present invention can be modified such that 
substitutions in the regions defined in Figure 1 are made in 
the context of a different or improved G-CSF protein. For 
example, native G-CSF with a Cysteine to Alanine 
substitution at position 17 may reduce aggregation and 
enhance stability and thus, can be used as the backbone used 
to create the glycosylated G-CSF analogs of the present 
invention. 

In addition, Reidhaar-Olson et al . , through . alanine 
scanning mutagenesis, describe residues critical to, the 
activity of human G-CSF. [Reidhaar-Olson et al . (1996) 
Biochemistry 35:9034-9041; See also Young et al . (1997) 
Protein Science 6:1228-1236], Thus, the glycosylated 
analogs of the present invention can be modified by 
substituting amino acids outside the glycosylated regions 
described in Figure 1. 

As outlined above, amino acid substitutions in the 
fusion proteins of the present invention can be based on the 
relative similarity of the amino acid side-chain 
substituents, for example, their hydrophobicity, 
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hydrophilicity, charge, size, etc. Furthermore, 
substitutions can be made based on secondary structure 
propensity- For example, a helical amino acid can be 
replaced with an amino acid that would preserve the helical 
structure. Exemplary substitutions that take various of the 
foregoing characteristics into consideration in order to 
produce conservative amino acid changes resulting in silent 
changes within the present peptides, etc., can be selected 
from other members of the class to which the naturally 
occurring amino acid belongs. 

The present invention also encompasses G-CSF analogs 
wherein the O-linked glycosylation site at position 133 is 
mutated to serve as an N- linked glycosylation site. The N- 
linked carbohydrate will generally have a higher sialic acid 
content which will protect it from the rapid clearance 
mechanisms associated with native G-CSF. 

The functions of a carbohydrate chain greatly depends 
on the structure of the attached carbohydrate moiety. 
Typically compounds with a higher sialic acid content will 
have better stability and longer half-lives in vivo. The re- 
linked oligosaccharides contain sialic acid in both an a2,3 
and an oc2,6 linkage to galactose. [Takeuchi et al . (1988) J. 
Biol. Chem. 263:3657], Typically the sialic acid in the 
a2,3 linkage is added to galactose on the mannose al,6 
branch and the sialic acid in the CX2,6 linkage is added to 
the galactose on the mannose ctl,3 branch. The enzymes that 
add these sialic acids (P-galactoside a2,3 sialyltransf erase 
and p-galactoside a2,6 sialyltransf erase) are most efficeint 
at adding sialic acid to the mannose al,6 and mannose al,3 
branches respectively. 

Tetra-antennary N-linked oligosachharides most commonly 
provide four possible sites for sialic acid attachment while 
bi- and tri-antennary oligosaccharide chains, which can 
substitute for the tetra-antennary form at Asn-liked sites, 

■ * * * 

commonly have at most only two or three sialic acids 
attached. O-linked oligosaccharides commonly provide only 
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two sites for sialic acid attachement. Mammalian cell 
cultures can be screened for those cells that preferentially 
add teta-antennary chains to the G-CSF analogs of the 
present invention, thereby maximizing the number of sites 
for sialic acid attachment. Different types of mammalian 
cells also differ with respect to the transferase enzymes 
present and consequently the sialic acid content and type of 
oligosachharide attached at each site. One way to optimize 
the carbohydrate content for a given G-CSF analog is to 
express the analog in a cell line wherein an expression 
plasmid containing DNA encoding a specific sialyl 
transferase (e.g., Ct2,6 sialyltrasnf erase) is co-transf ected 
with the G-CSF analog expression plasmid. 
Alternatively a host cell line may be stably transfected 
with a sialyltransf erase cDNA and that host cell used to 
express the G-CSF analog of interest. Thus, it is 
preferable if the oligosaccharide structure and sialic acid 
content are optimized for each analog encompassed by the 
present invention. 



Heterologous Fc fusion proteins: 

The hyperglycosylated G-CSF analogs described above can 
be fused directly or via a peptide linker to the Fc portion 
of an immunoglobulin. (See Figures 6-7) . 

Immunoglobulins are molecules containing polypeptide 
chains held together by disulfide bonds, typically having 
two light chains and two heavy chains. In each chain, one 
domain (V) has a variable amino acid sequence depending on 
the antibody specificity of the molecule. The other domains 
(C) have a rather constant sequence common to molecules of 

the same class. 

As used herein, the Fc portion of an immunoglobulin has 

the meaning commonly given to the term in the field of 

immunology. Specifically, this term refers to an antibody 

fragment which is obtained by removing the two antigen 

binding regions (the Fab fragments) from the antibody. 
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Thus, the Fc portion is formed from approximately equal 
sized fragments of the constant region from both heavy 
chains, which associate through non-covalent interactions 
and disulfide bonds. The Fc portion can include the hinge 
regions and extend through the CH2 and CH3 domains to the C- 
terminus of the antibody. Representative hinge regions for 
human and mouse immunoglobulins can be found in Antibody 
Engineering, A Practical Guide, Borrebaeck, C.A.K., ed., 
W.H. Freeman and Co., 1992, the teachings of which are 
herein incorporated by reference. The amino acid sequence 
of a representative Fc protein containing a hinge region, 
CH2 and CH3 domains is shown in Figures 2a and 2b. 

There are five types of human immunoglobulin Fc regions 
with different effector and pharmacokinetic properties: IgG, 
IgA, IgM, IgD, and IgE. IgG is the most abundant 
immunoglobulin in serum. IgG also has the longest half -life 
in serum of any immunoglobulin (23 days) . Unlike other 
immunoglobulins, IgG is efficiently recirculated following 
binding to an Fc receptor. There are four IgG subclasses 
Gl, G2, G3 , and G4 , each of which have different effector 
functions. Gl, G2, and G3 can bind Clq and fix complement 
while G4 cannot. Even though G3 is able to bind Clq more 
efficiently than Gl, Gl is more effective at mediating 
complement-directed cell lysis. G2 fixes complement very 
inefficiently. The Clq binding site in IgG is located at 
the carboxy terminal region of the CH2 domain. 

All IgG subclasses are capable of binding to Fc 
receptors (CD16, CD32, CD64) with Gl and G3 being more 
effective than G2 and G4 . The Fc receptor-binding region of 
IgG is formed by residues located in both the hinge and the 
carboxy terminal regions of the CH2 domain. 

IgA can exist both in a monomeric and dimeric form held 
together by a J-chain. IgA is the second most abundant Ig 
in serum, but it has a half-life of only 6 days. IgA has 
three effector functions. It binds to an IgA specific 
receptor on macrophages and eosinophils, which drives 
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phagocytosis and degranulation, respectively. It can also 
fix complement via an unknown alternative pathway. 

IgM is expressed as either a pentamer or a hexamer, 
both of which are held together by a J-chain. IgM has a 
serum half -life of 5 days. It binds weakly to Clq via a 
binding site located in its CH3 domain. IgD has a half -life 
of 3 days in serum. It is unclear what effector functions 
are attributable to this Ig. IgE is a monomeric Ig and has 
a serum half -life of 2.5 days. IgE binds to two Fc 
receptors which drives degranulation and results in the 
release of proinflammatory agents. 

Depending on the desired in vivo effect, the heterologous 
fusion proteins of the present invention may contain any of 
the isotypes described above or may contain mutated Fc 
regions wherein the complement and/or Fc receptor binding 
functions have been altered. For example, one embodiment of 
the present invention is a Ser229Pro mutation in IgG4 Fc, 
which reduces monomer formation. See Figure 5. 

The heterologous fusion proteins of the present 
invention may contain the entire Fc portion of an 
immunoglobulin, fragments of the Fc portion of an 
immunoglobulin, or analogs thereof fused to a G-CSF analog. 
Furthermore, the Fc portion may be fused at either terminus 

or at both termini. 

The heterologous fusion proteins of the present 
invention can consist of single chain proteins or as multi- 
chain polypeptides. Two or more Fc fusion proteins can be 
produced such that they interact through disulfide bonds 
that naturally form between Fc regions. These multimers can 
be homogeneous with respect to the G-CSF analog or they may 
contain different G-CSF analogs fused at the N- terminus of 
the Fc portion of the fusion protein. 

Regardless of the final structure of the fusion 
protein, the Fc or Fc-like region must serve to prolong the 
in vivo plasma half-life of the G-CSF analog compared to the 
native G-CSF. Furthermore, the fused G-CSF analog must 
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retain some biological activity. Biological activity can be 
determined by in vitro and in vivo methods known in the art. 

Since the Fc region of IgG produced by proteolysis has 
the same in vivo half-life as the intact IgG molecule and 
Fab fragments are rapidly degraded, it is believed that the 
relevant sequence for prolonging half-life resides in the 
CH2 and/or CH3 domains. Further, it has been shown in the 
literature that the catabolic rates of IgG variants that do 
not bind the high-affinity Fc receptor or Clq are 
indistinguishable from the rate of clearance of the parent 
wild-type antibody, indicating that the catabolic site is 
distinct from the sites involved in Fc receptor or Clq 
binding. [Wawrzynczak et al . , (1992) Molecular Immunology 
29:221], Site-directed mutagenesis studies using a murine 
IgGl Fc region suggested that the site of the IgGl Fc region 
that controls the catabolic rate is located at the CH2-CH3 
domain interface. 

Based on these studies, Fc. regions can be modified at 
the catabolic site to optimize the half-life of the fusion 
proteins. It is preferable that the Fc region used for the 
heterologous fusion proteins of the present invention be 
derived from an IgGl (see Figure 4) or an IgG4 Fc region. 
It is even more preferable that the Fc region be IgG4 or 
derived from IgG4 . Preferably the IgG Fc region contains 
both the CH2 and CH3 regions including the hinge region. 

Heterologous albumin fusion proteins : 

The G-CSF analogs described above can be fused directly 
or via a peptide linker to albumin or an analog, fragment, 
or derivative thereof. (See Figure 8) . 

Generally the albumin proteins making up part of the 
fusion proteins of the present invention can be derived from 
albumin cloned from any species. However, human albumin and 
fragments and analogs thereof are preferred to reduce the 
risk of the fusion protein being immunogenic in humans. 
Human serum albumin (HA) consists of a single non- 
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glycosylated polypeptide chain of 585 amino acids with a 
formula molecular weight of 66,500. The amino ' acid sequence 
of human HA is shown in Figure 3. [See Meloun, et al. 
(1975) FEBS Letters 58:136; Behrens, et al . (1975) Fed. 
Proc. 34:591; Lawn, -et al. (1981) Nucleic Acids Research 
9:6102-6114; Minghetti, et al . (1986) J. Biol. Chem. 
261:6747] . A variety of polymorphic variants as well as 
analogs and fragments of albumin have been described. [See 
Weitkamp, et al., (1973) Ann. Hum. Genet. 37:219]. For 
example, in EP 322,094, the inventors disclose various 
shorter forms of HA. Some of these fragments include HA(1- 
373), HA(l-388), HA(l-389), HA(l-369), andHA(l-419) and 
fragments between 1-369 and 1-419. EP 399,666 discloses 
albumin fragments that include HA (1-177) and HA (1-200) and 
fragments between HA(1-177) and HA(l-200) . 

It is understood that the heterologous fusion proteins 
of the present invention include G-CSF analogs that are 
coupled to any albumin protein including fragments, analogs, 
and derivatives wherein such fusion protein is biologically 
active and has a longer plasma half-life than the G-CSF 
analog alone. Thus, the albumin portion of the fusion 
protein need not necessarily have a plasma half-life equal 
to that of native human albumin. In addition, the albumin 
may be fused to either terminus or both termini of the 
hyperglycosylated G-CSF analog. Fragments, analogs, and 
derivatives are known or can be generated that have longer 
half-lives or have half-lives intermediate to that of native 
human albumin and the G-CSF analog of interest. 

The heterologous fusion proteins of the present 
invention encompass proteins having conservative amino acid 
substitutions in the G-CSF analog and/or the Fc or albumin 
portion of the fusion protein. A "conservative 
substitution" is the replacement of an amino acid with 
another amino acid that has the same net electronic charge 
and approximately the same size and shape. Amino acids with 
aliphatic or substituted aliphatic amino acid side chains 
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have approximately the same size when the total number 
carbon and heteroatoms in their side chains differs by no 
more than about four. They have approximately the same 
shape when the number of branches in their side chains 
differs by no more than one. Amino acids with phenyl or 
substituted phenyl groups in their side chains are 
considered to have about the same size and shape. Except as 
otherwise specifically provided herein, conservative 
substitutions are preferably made with naturally occurring 
amino acids. 

However, the term "amino acid" is used herein in its 
broadest sense, and includes naturally occurring amino acids 
as well as non-naturally occurring amino acids, including 
amino acid analogs and derivatives. The latter includes 
molecules containing an amino acid moiety. One skilled in 
the art will recognize, in view of this broad definition, 
that reference herein to an amino acid includes, for 
example, naturally occurring proteogenic L-amino acids; D- 
amino acids; chemically modified amino acids such as amino 
acid analogs and derivatives; naturally occurring non- 
proteogenic amino acids such as norleucine, P-alanine, 
ornithine, GAB A, etc.; and chemically synthesized compounds 
having properties known in the art to be characteristic of 
amino acids. As used herein, the term "proteogenic" 
indicates that the amino acid can be incorporated into a 
peptide, polypeptide, or protein in a cell through a 
metabolic pathway. 

The incorporation of non-natural amino acids, including 
synthetic non-native amino acids, substituted amino acids, 
or one or more D-amino acids into the heterologous fusion 
proteins of the present invention can be advantageous in a 
number of different ways. D-amino acid-containing peptides, 
etc., exhibit increased stability in vitro or in vivo 
compared to L-amino acid-containing counterparts. Thus, the 
construction of peptides, etc., incorporating D-amino acids 
can be particularly useful when greater intracellular 
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stability is desired or required. More specifically, D- 
peptides, etc., are resistant to endogenous peptidases and 
proteases, thereby providing improved bioavailability of the 
molecule, and prolonged lifetimes in vivo when such 
properties are desirable. Additionally, D-peptides, etc., 
cannot be processed efficiently for major histocompatibility 
complex class II-restricted presentation to T helper cells, 
and are therefore less likely to induce humoral immune 
responses in the whole organism. 

General methods for making the heterologous fusion proteins 
of the present invention . 

Although the heterologous fusion proteins of the 
present invention can be made by a variety of different 
methods, recombinant methods are preferred. For purposes of 
the present invention, as disclosed and claimed herein, the 
following general molecular biology terms and abbreviations 
are defined below. The terms and abbreviations used in this 
document have their normal meanings unless otherwise 
designated. For example, "°C" refers to degrees Celsius; 
"mmol" refers to millimole or millimoles; "mg" refers to 
milligrams ; "(Ltg" refers to micrograms; "ml or mL" refers to 

to 1 ** * 

milliliters; and w jll or (iL"|'refers to microliters. Amino 
acids abbreviations are as set forth in 37 C.F.R. § 1.822 

(b) (2) (1994) . 

"Base pair" or u bp" as used herein refers to DNA or 
RNA. The abbreviations A,C,G, and T correspond to the 5'- 
monophosphate forms of the deoxyribonucleosides 
(deoxy ) adenosine , (deoxy ) cyt idine , (deoxy ) guanosine , and 
thymidine, respectively, when they occur in DNA molecules. 
The abbreviations U,C,G, and A correspond to the 5'- 
monophosphate forms of the ribonucleosides uridine, 
cytiHine, guanosine, and adenosine, respectively when they 
occur in RNA molecules. In double stranded DNA, base pair 
may refer to a partnership of A with T or C with G. In a 
DNA/RNA, heteroduplex base pair may refer to a partnership 
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of A with U or C with G. (See the definition of 
"complementary" , infra , ) 

"Digestion" or "Restriction" of DNA refers to the 
catalytic cleavage of the DNA with a restriction enzyme that 
acts only at certain sequences in the DNA ("sequence- 
specific endonucleases" ) . The various restriction enzymes 
used herein are commercially available and their reaction 
conditions, cof actors, and other requirements were used as 
would be known to one of ordinary skill in the art. 
Appropriate buffers and substrate amounts for particular 
restriction enzymes are specified by the manufacturer or can 
be readily found in the literature. 

"Ligation" refers to the process of forming 
phosphodiester bonds between two double stranded nucleic 
acid fragments. Unless otherwise provided, ligation may be 
accomplished using known buffers and conditions with a DNA 
ligase, such as T4 DNA ligase. 

"Plasmid" refers to an extrachromosomal (usually) self- 
replicating genetic element. Plasmids are generally 
designated by a lower case "p" followed by letters and/or 
numbers. The starting plasmids herein are either 
commercially available, publicly available on an 
unrestricted basis, or can be constructed from available 
plasmids in accordance with published procedures. In 
addition, equivalent plasmids to those described are known 
in the art and will be apparent to the ordinarily skilled 
artisan. 

"Recombinant DNA cloning vector" as used herein refers 
to any autonomously replicating agent, including, but not 
limited to, plasmids and phages, comprising a DNA molecule 
to which one or more additional DNA segments can or have 
been added. 

* 

"Recombinant DNA expression vector" as used herein 
refers to any recombinant DNA cloning vector in which a 
promoter to control transcription of the inserted DNA has 
been incorporated. 



* 
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"Transcription" refers to the process whereby 
information contained in a nucleotide sequence of DNA is 
transferred to a complementary RNA sequence. 

"Transf ection" refers to the uptake of an expression 
vector by a host cell whether or not any coding sequences 
are, in fact, expressed. Numerous .methods of transf ection 
are known to the ordinarily skilled artisan, for example, 
calcium phosphate co-precipitation, liposome transf ection, 
and electroporation. Successful transfection is generally 
recognized when any indication of the operation of this 
vector occurs within the host cell. 

"Transformation" refers to the introduction of DNA into 
an organism so that the DNA is replicable, either as an 
extrachromosomal element or by chromosomal integration. 
Methods of transforming bacterial and eukaryotic hosts are 
well known in the art, many of which methods, such as 
nuclear injection, protoplast fusion or by calcium treatment 
using calcium chloride are summarized in J. Sambrook, et 
al., Molecular Cloning: A Laboratory Manual, (1989). 
Generally, when introducing DNA into Yeast the term 
transformation is used as opposed to the term transfection. 

"Translation" as used herein refers to the process 
whereby the genetic information of messenger RNA (mRNA) is 
used to specify and direct the synthesis of a polypeptide 
chain. 

"Vector" refers to a nucleic acid compound used for the 
transfection and/or transformation of cells in gene 
manipulation bearing polynucleotide sequences corresponding 
to appropriate protein molecules which, when combined with 
appropriate control sequences, confers specific properties 
on the host cell to be transfected and/or transformed. 
Plasmids, viruses, and bacteriophage are suitable vectors. 
Artificial vectors are constructed by cutting and joining 
DNA molecules from different sources using restriction 
enzymes and ligases. The term "vector" as used herein 
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includes Recombinant DNA cloning vectors and Recombinant DNA 
expression vectors. 

"Complementary" or "Complementarity", as used herein, 
refers to pairs of bases (purines and pyrimidines) that 
associate through hydrogen bonding in a double stranded 
nucleic acid. The following base pairs are complementary: 
guanine and cytosine; adenine and thymine; and adenine and 
uracil . 

"Hybridization" as used herein refers to a process in 
which a strand of nucleic acid joins with a complementary 
strand through base pairing. The conditions employed in the 
hybridization of two non- identical , but very similar, 
complementary nucleic acids varies with the degree of 
complementarity of the two strands and the length of the 
strands. Such techniques and conditions are well known to 
practitioners in this field. 

"Isolated amino acid sequence" refers to any amino acid 
sequence, however, constructed or synthesized, which is 
locationally distinct from the naturally occurring sequence. 

"Isolated DNA compound" refers to any DNA sequence, 
however constructed or synthesized, which is locationally 
distinct from its natural location in genomic DNA. 

"Isolated nucleic acid compound" refers to any RNA or 
DNA sequence, however constructed or synthesized, which is 
locationally distinct from its natural location. 

* 

"Primer" refers to a nucleic acid fragment which 
functions as an initiating substrate for enzymatic or 
synthetic elongation. 

"Promoter" refers to a DNA sequence which directs 
transcription of DNA to RNA. 

"Probe" refers to a nucleic acid compound or a 
fragment, thereof, which hybridizes with another nucleic 
acid compound. 

"Stringency" of hybridization reactions is readily 
determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe 
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length, washing temperature, and salt concentration. In 
general, longer probes require higher temperatures for proper 
annealing, while short probes need lower temperatures. 
Hybridization generally depends on the ability of denatured 
DNA to re-anneal when complementary strands are present in an 
environment below their melting temperature. The higher the 
degree of desired homology between the probe and hybridizable 
sequence, the higher the relative temperature that can be 
used. As a result, it follows that higher relative 
temperatures would tend to make the reactions more stringent, 
while lower temperatures less so. For additional details and 
explanation of stringency of hybridization reactions, see 
Ausubel et al . , Current Protocols in Molecular Biology, Wiley 
Interscience Publishers, 1995. 

"Stringent conditions" or "high stringency conditions", 
as defined herein, may be identified by those that (1) employ 
low ionic strength and high temperature for washing, for 
example, 15 mM sodium chloride/1.5 mM sodium citrate/0.1% 
sodium dodecyl sulfate at 50°C; (2) employ during 
hybridization a denaturing agent, such as formamide, for 
example, 50% (v/v) formamide with 0.1% bovine serum 
albumin/0.1% ficoll/0.1% polyvinylpyrrolidone/50 mM sodium 
phosphate buffer at pH 6.5 with 750 mM sodium chloride/75 mM 
sodium citrate at 42 °C; or (3) employ 50% formamide, 5X SSC 
(750 mM sodium chloride, 75 mM sodium citrate) , 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5X Denhardt's 
solution, sonicated salmon sperm DNA (50 yg/ml) , 0.1% SDS, and 
10% dextran sulfate at 42°C with washes at 42°C in 0.2X SSC 
(30 mM sodium chloride/3 mM sodium citrate) and 50% formamide 
at 55°C, followed by a high-stringency wash consisting of 0 . IX 
SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as 
described by Sambrook et al . [Molecular Cloning: A Laboratory 
Manual, New York: Cold Spring Harbor Press, (1989)], and 
include the use of washing solution and hybridization 
conditions (e.g., temperature, ionic strength, and %SDS) less 
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stringent than those described above. An example of 
moderately stringent conditions is overnight incubation at 
37°C in a solution comprising: 20% formamide, 5X SSC (750 mM 
sodium chloride, 75 mM sodium citrate), 50 mM sodium phosphate 
at pH 7.6, 5X Denhardt's solution, 10% dextran sulfate, and 20 
mg/mL denatured sheared salmon sperm DNA, followed by washing 
the filters in IX SSC at about 37-50°C. The skilled artisan 
will recognize how to adjust the temperature, ionic strength, 
etc., as necessary to accommodate factors such as probe length 
and the like. 

"PCR" refers to the widely-known polymerase chain 
reaction employing a thermally- stable DNA polymerase. 

"Leader sequence" refers to a sequence of amino acids 
which can be enzymatically or chemically removed to produce 
the desired polypeptide of interest. 

"Secretion signal sequence" refers to a sequence of 
amino acids generally present at the N-terminal region of a 
larger polypeptide functioning to initiate association of 
that polypeptide with the cell membrane and secretion of 
that polypeptide through the cell membrane. 

Construction of DNA encoding the heterologous fusion 
proteins of the present invention: 

Wild type albumin and immunoglobulin proteins can be 
obtained from a variety of sources. For example, these 
proteins can be obtained from a cDNA library prepared from 
tissue or cells which express the mRNA of interest at a 
detectable level. Libraries can be screened with probes 
designed using the published DNA or protein sequence for the 
particular protein of interest. 

Screening a cDNA or genomic library with the selected 
probe may be conducted using standard procedures, such as 
described in Sambrook et al . , Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
(1989) . An alternative means to isolate a gene encoding an 
albumin or immunoglobulin protein is to use PCR methodology 
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[Sambrook et al . , supra; Dieffenbach et al . , PCR Primer: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
(1995)] . PCR primers can be designed based on published 
sequences . 

Generally the full-length wild-type sequences cloned 
from a particular species can serve as a template to create 
analogs, fragments, and derivatives that retain the ability 
to confer a longer plasma half-life on the G-CSF analog that 
is part of the fusion protein. It is preferred that the Fc 
and albumin portions of the heterologous fusion proteins of 
the present invention be derived from the native human 
sequence in order to reduce the risk of potential 
immunogenicity of the fusion protein in humans. 

In particular, it is preferred that the immunoglobulin 
portion of a fusion protein encompassed by the present 
invention contain only an Fc fragment of the immunoglobulin. 
Depending on whether particular effector functions are 
desired and the structural characteristics of the fusion 
protein, an Fc fragment may contain the hinge region along 
with the CH2 and CH3 domains or some other combination 
thereof. These Fc fragments can be generated using PCR 
techniques with primers designed to hybridize to sequences 
corresponding to the desired ends of the fragment. 
Similarly, if fragments of albumin are desired, PCR primers 
can be designed which are complementary to internal albumin 
sequences. PCR primers can also be designed to create 
restriction enzyme sites to facilitate cloning into 
expression vectors. 

DNA encoding human G-CSF can be obtained from a cDNA 
library prepared from tissue or cells which express G-CSF 
mRNA at a detectable level such as monocytes, macrophages, 
vascular endothelial cells, fibroblasts, and some human 
malignant and leukemic myeloblastic cells. Libraries can be 
screened with probes designed using the published DNA 
sequence for human G-CSF. [Souza L. et al . (1986) Science 
232:61-65]. Screening a cDNA or genomic library with the 
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selected probe may be conducted using standard procedures, 
such as described in Sambrook et al . , Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
(1989) . An alternative means to isolate the gene encoding 
human G-CSF is to use PCR methodology [Sambrook et al., 
supra; Dieffenbach et al . , PCR Primer: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, NY (1995)] . 

The glycosylated G-CSF analogs of the present invention 
can be constructed by a variety of mutagenesis techniques 
well known in the art. Specifically, a representative 
number of glycosylated G-CSF analogs were constructed using 
mutagenic PCR from a cloned wild-type human G-CSF DNA 
template (Example 1) . 

The glycosylated G-CSF analogs of the present invention 
may be produced by other methods including recombinant DNA 
technology or well known chemical procedures, such as solution 
or solid-phase peptide synthesis, or semi-synthesis in solution 
beginning with protein fragments coupled through conventional 
solution methods. 

Recombinant DNA methods are preferred for producing the 
glycosylated G-CSF analogs of the present invention. Host 
cells are transfected or transformed with expression or 
cloning vectors described herein for glycosylated G-CSF 
analog production and cultured in conventional nutrient 
media modified as appropriate for inducing promoters, 
selecting transf ormants , or amplifying the genes encoding 
the desired sequences (Example 2) . The culture conditions, 
such as media, temperature, pH and the like, can be selected 
by the skilled artisan without undue experimentation 

Physical stability is an essential feature for 
therapeutic formulations. The physical stability of the 
heterologous fusion proteins of the present invention 
depends on their conformational stability, the number of 
charged residues (pi of the protein) , the ionic strength and 
pH of the formulation, and the protein concentration, among 
other possible factors. As discussed previously, the G-CSF 
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analog portion of the heterologous fusion proteins can be 
successfully glycosylated and expressed such that they 
maintain their three dimensional structure. Because these 
analogs are able to fold properly in a hyperglycosylated 
state, they will have improved conformational and physical 
stability relative to wild-type G-CSF. 

While wild-type G-CSF produced in mammalian cells and 
bacterial cells has similar activity in vivo, the mammalian 
cell -produced protein has increased conformational and 
physical stability due to the presence of a single O-linked 
sugar moiety present at position 133. Thus, the G-CSF 
analog portion of the heterologus fusion proteins, which 
have an increased glycosylation content compared to wild- 
type G-CSF produced in mammalian or bacterial cells, will 
have increased stability. Furthermore, it is likely that 
glycosylation may inhibit inter-domain interactions and 
consequently enhance stability by preventing inter-domain 
disulfide shuffling. 

The gene encoding a heterologous fusion protein can be 
constructed by ligating DNA encoding a G-CSF analog in-frame 
to DNA encoding an albumin or Fc protein. The gene encoding 
the G-CSF analog and the gene encoding the albumin or Fc 
protein can also be joined in-frame via DNA encoding a 
linker peptide. 

The in vivo function and stability of the heterologous 
fusion proteins of the present invention can- be optimized by 
adding small peptide linkers to prevent potentially unwanted 
domain interactions. Although these linkers can potentially 
be any length and consist of any combination of amino acids, 
it is preferred that the length be no longer than necessary 
to prevent unwanted domain interactions and/or optimize 
biological activity and/or stability. Generally, the 
linkers should not contain amino acids with extremely bulky 
side chains or amino acids likely to introduce significant 
secondary structure. It is preferred that the linker be 
serine-glycine rich and be less than 30 amino acids in 
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length. It is more preferred that the linker be no more 
than 20 amino acids in length. It is even more preferred 
that the linker be no more than 15 amino acids in length. A 
preferred linker contains repeats of the sequence Gly-Gly- 
Gly-Gly-Ser. It is preferred that there be between 2 and 6 
repeats of this sequence. It is even more preferred that 
there be between 3 and 4 repeats of this sequence. 

To construct the heterolgous G-CSF fusion proteins, the 
DNA encoding wild-type G-CSF, albumin, and Fc polypeptides 
and fragments thereof can be mutated either before ligation 
or in the context of a cDNA encoding an entire fusion 
protein. A variety of mutagenesis techniques are well known 
in the art. For example, a mutagenic PCR method utilizes 
strand overlap extension to create specific base mutations 
for the purposes of changing a specific amino acid sequence 
in the corresponding protein. This PCR mutagenesis requires 
the use of four primers, two in the forward orientation 
(primers A and C) and two in the reverse orientation 
(primers B and D) . A mutated gene is amplified from the 
wild-type . template in two different stages. The first 
reaction amplifies the gene in halves by performing an A to 
B reaction and a separate C to D reaction wherein the B and 
C primers target the area of the gene to be mutated. When 
aligning these primers with the target area, they contain 
mismatches for the bases that are targeted to be changed. 
Once the A to B and C to D reactions are complete, the 
reaction products are isolated and mixed for use as the 
template for the A to D reaction. This reaction then yields 
the full, mutated product. 

Once a gene encoding an entire fusion protein is 
produced it can be cloned into an appropriate expression 
vector. Specific strategies that can be employed to make 
the G-CSF fusion proteins of the present invention are 
described in example 1. 
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General methods to recombinantly express the heterologous 
fusion proteins of the present invention: 

Host cells are transfected or transformed with expression 
or cloning vectors described herein for heterologous fusion 
protein production and cultured in conventional nutrient media 
modified as appropriate for inducing promoters, selecting 
■ transformants, or amplifying the genes encoding the desired 
sequences. The culture conditions, such as media, 
temperature, pH and the like, can be selected by the skilled 
artisan without undue experimentation. In general, 
principles, protocols, and practical techniques for maximizing 
the productivity of cell cultures can be found in Mammalian 
Cell Biotechnology: A Practical Approach, M. Butler, ed. (IRL 
Press, 1991) and Sambrook, et al . , supra. Methods of 
transfection are known to the ordinarily skilled artisan, for 
example, CaP0 4 and elect roporation. General aspects of 
mammalian cell host system transformations have been described 
in U.S. Patent No. 4,3 99,216. Transformations into yeast are 
typically carried out according to the method of van Solingen 
et al., J Bact. 130 (2): 946-7 (1977) and Hsiao et al . , Proc. 
Natl. Acad. Sci . USA 76(8): 3829-33 (1979). Suitable host 
cells for the expression of the fusion proteins of the present 
invention are derived from multicellular organisms. 

The fusion proteins of the present invention may be 
recombinantly produced directly, or as a protein having a 
signal sequence or other additional sequences which create a 
specific cleavage site at the N-terminus of the mature fusion 
protein. In general, the signal sequence may be a component 
of the vector, or it may be a part of the fusion protein- 
encoding DNA that is inserted into the vector. The signal 
sequence may be a prokaryotic signal sequence selected, for 
example, from the group of the alkaline phosphatase, 
penicillinase, lpp, or heat-stable enterotoxin II leaders. 
For yeast secretion the signal sequence may be, e.g., the 
yeast invertase leader, alpha factor leader (including 
Saccharomyces and Kluyveromyces cc-f actor leaders, the latter 
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describe'd in U.S. Patent No. 5,010,182), or acid phosphatase 
leader, the C. albicans glucoamylase leader (EP 362,179), or 
the signal described in WO 90/13646. In mammalian cell 
expression, mammalian signal sequences may be used to direct 
secretion of the protein, such as signal sequences from 
secreted polypeptides of the same or related species as well 
as viral secretory- leaders . 

Both expression and cloning vectors contain a nucleic 
acid sequence that enables the vector to replicate in one or 
more selected host cells. Such sequences are well known for a 
variety of bacteria, yeast, and viruses. The origin of 
replication from the plasmid pBR322 is suitable for most Gram- 
negative, bacteria, the 2u plasmid origin is suitable for 
yeast, and various viral origins (SV40, polyoma, adenovirus, 
VSV or BPV) are useful for cloning vectors in mammalian cells. 
Expression and cloning vectors will typically contain a 
selection gene, also termed a selectable marker. Typical 
selection genes encode proteins that (a) confer resistance to 
antibiotics or other toxins, e.g., ampicillin, neomycin, 
methotrexate, or tetracycline, (b) complement autotrophic 
deficiencies, or (c) supply critical nutrients not available 
from complex media, e.g., the gene encoding D-alanine racemase 
for Bacilli. 

An example of suitable selectable markers for mammalian 
cells are those that enable the identification of cells 
competent to take up the fusion protein-encoding nucleic acid, 
such as DHFR or thymidine kinase. An appropriate host cell 
when wild-type DHFR is employed is the CHO cell line deficient 
in DHFR activity, prepared and propagated as described [Urlaub 
and Chasin, Proc. Natl. Acad. Sci. USA, 77(7): 4216-20 
(1980)]. A suitable selection gene for use in yeast is the 
trpl gene present in the yeast plasmid Yrp7 [Stinchcomb, et 
al. f Nature 282(5734): 39-43 (1979); Kingsman, et al . , Gene 
7(2): 141-52 (1979); Tschumper, et al . , Gene 10(2): 157-66 
(1980)]. The trpl gene provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in 



WO 03/076567 PCT7TJS03/03120 

-38- 

tryptophan, for example, ATCC No. 44076 or PEPC1 [Jones, 
Genetics 85: 23-33 (1977)]. 

Expression and cloning vectors usually contain a promoter 

* 

operably linked to the fusion protein- encoding nucleic acid 
sequence to direct mRNA synthesis. Promoters recognized by a 
variety of potential host cells are well known. Promoters 
suitable for use with prokaryotic hosts include the |3- 
lactamase and lactose promoter systems [Chang, et al . , Nature 
275 (5681): 617-24 (1978); Goeddel, et al . , Nature 281 (5732) : 
544-8 (1979)], alkaline phosphatase, a tryptophan (up) 
promoter system [Goeddel, Nucleic Acids Res. 8(18): 4057-74 
(1980); EP 36,776 published 30 September 1981], and hybrid 
promoters such as the tat promoter [deBoer, et al., Proc. 
Natl. Acad. Sci . USA 80(1): 21-5 (1983)]. Promoters for use 
in bacterial systems also will contain a Shine-Dalgarno (S.D.) 
sequence operably linked to the DNA encoding the fusion 
protein. 

Transcription of a polynucleotide encoding a fusion 
protein by higher eukaryotes may be increased by inserting an 
enhancer sequence into the vector. Enhancers are cis -acting 
elements of DNA, usually about from 10 to 300 bp, that act on 
a promoter to increase its transcription. Many enhancer 
sequences are now known from mammalian genes (globin, 
elastase, albumin, a-ketoprotein, and insulin) . Typically, 
however, one will use an enhancer from a eukaryotic cell 
virus. Examples include the SV40 enhancer on the late side of 
the replication origin (bp 100-270), the cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers. The 
. enhancer may be spliced into the vector at a position 5' or 3' 
to the fusion protein coding sequence but is preferably 
located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, 
fungi, insect, plant, animal, human, or nucleated cells from 
other multicellular organisms) will also contain sequences 
necessary for the termination of transcription and for 
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stabilizing the mRNA. Such sequences are commonly available 
from the 5' and occasionally 3' untranslated regions of 
eukaryotic or viral DNAs or cDNAs. These regions contain 
nucleotide segments transcribed as polyadenylated fragments in 
the untranslated portion of the mRNA encoding the fusion 
protein. 

Various forms of a fusion protein may be recovered from 
culture medium or from host cell lysates. If membrane -bound, 
it can be- released- from the membrane using a suitable 
detergent solution (e.g., Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of a fusion protein 
can be disrupted by various physical or chemical means, such 
as freeze-thaw cycling, sonication, mechanical disruption, or 
cell lysing agents. 

Purification of the heterologous fusion proteins of the 
present invention : 

Once the heterologous fusion proteins of the present 
invention are expressed in the appropriate host cell, the 
analogs can be isolated and purified. The following 
procedures are exemplary of suitable purification procedures: 

Various methods of protein purification may be employed 
and such methods are known in the art and described, for 
example, in Deutscher, Methods in Enzymology 182: 83-9 (1990) 
and Scopes, Protein Purification: Principles and Practice, 
Springer-Verlag, NY (1982). The purification step(s) selected 
will depend on the nature of the production process used and 
the particular fusion protein produced. For example, fusion 
proteins comprising an Fc fragment can be effectively purified 
using a Protein A or Protein G affinity matix. Low or high pH 
buffers can be used to elute the fusion protein from the 
affinity matrix. Mild elution conditions will aid in 
preventing irreversible denaturation. of the fusion protein. 
Imidazole-containing buffers can also be used. Example 3 
describes some successful purification protocols for the 

* 

fusion proteins of the present invention. 
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Characterization of the heterologous fusion proteins of the ■ 

present invention : 

Numerous methods exist to characterize the fusion 
proteins of the present invention. Some of these methods 
include: SDS-PAGE coupled with protein staining methods or 
immunoblotting using anti-IgG, anti-HA and anti-G-CSF 
antibodies. Other methods include matrix assisted laser 
desporption/ionization-mass spectrometry (MALDI-MS) , liquid 
chromatography/mass spectrometry, isoelectric focusing, 
analytical anion exchange, chrornatof ocussing, and circular 
dichroism to name a few. A representative number of 
heterologous fusion proteins were characterized using SDS-PAGE 
coupled with immunoblotting as well as mass spectrometry 

For example, Table 2 illustrates the calculated molecular 
mass for a representative number of fusion proteins as well as 
the observed mass (as measured by protease mapping/LC-MS) . 
The relative differences between observed mass and mass 
calculated for a nonglycosylated protein are indicative of the 

extent of glycosylation. 

The heterologous fusion proteins of the present invention 
may be formulated with one or more excipients. The active 
fusion proteins of the present invention may be combined with 
a pharmaceutical^ acceptable buffer, and the pH adjusted to 
provide acceptable stability, and a pH acceptable for 
adminstration such as parenteral administration. 

Optionally, one or more pharmaceutically-acceptable anti- 
microbial agents may be added. Meta-cresol and phenol are 
preferred pharmaceutically-acceptable microbial agents. One 
or more pharmaceutically-acceptable salts may be added to 
adjust the ionic strength or tonicity. One or more excipients 
may be added to adjust the isotonicity of the formulation. 
Glycerin is an example of an isotonicity-adjusting excipient. 
Pharmaceutical^ acceptable means suitable for adminstration 
to a human or other animal and thus, does not contain toxic 
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elements or undersirable contaminants and does not interfere 
with the activity of the active compounds therein. 

A pharmaceutically-acceptable salt form of the 
heterologous fusion proteins of the present invention may be 
used in the present invention. Acids commonly employed to 
form acid addition salts are inorganic acids such as 
hydrochloric acid, hydrobromic acid, hydriodic acid, sulfuric 
acid, phosphoric acid, and the like, and organic acids such as 
p-toluenesulf onic acid, methanesulf onic acid, oxalic acid, p- 
bromophenyl- sulfonic acid, carbonic acid, succinic acid, 
citric acid, benzoic acid, acetic acid, and the like. 
Preferred acid addition salts are those formed with mineral 
acids such as hydrochloric acid and hydrobromic acid. 

Base addition salts include those derived from inorganic 
bases, such as ammonium or alkali or alkaline earth metal 
hydroxides, carbonates, bicarbonates, and the like. Such 
bases useful in preparing the salts of this invention thus 
include sodium hydroxide, potassium hydroxide, ammonium 
hydroxide, potassium carbonate, and the like. 

Admins t rat ion of Compositions; 

Administration may be via any route known to be effective 
by the physician of ordinary skill. Peripheral, parenteral is 
one such method. Parenteral administration is commonly 
understood in the medical literature as the injection of a 
dosage form into the body by a sterile synringe or some other 
mechanical device such as an infusion pump. Peripheral 
parenteral routes can include intravenous, intramuscular, 
subcutaneous, and intraperitoneal routes of administration. 

The heterologous fusion proteins of the present invention 
may also be amenable to adminstration by oral, rectal, nasal, 
or lower respiratory routes, which are non-parenteral routes. 
Of these non-parenteral routes, the lower respiratory route 
and the oral route are preferred. 

The heterologous fusion proteins of the present 
invention can be used to treat patients with insufficient 
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circulating neutrophil levels, typically those undergoing 

cancer chemotherapy. 

An "effective amount" of the heterologous fusion 
protein is the quantity which results in a desired 
therapeutic and/or prophylactic effect without causing 
unacceptable side-effects when administered to a subject in 
need of G-CSF receptor stimulation. A "desired therapeutic 
effect" includes one or more of the following: 1) an 
amelioration of the symptom (s) associated with the disease 
or condition; 2) a delay in the onset of symptoms associated 
with the disease or condition; 3) increased longevity 
compared with the absence of the treatment; and 4) greater 
quality of life compared with the absence of the treatment. 

The present invention comprises G-CSF compounds that 
have improved biochemical and biophysical properties by 
virtue of being fused to an albumin protein, an albumin 
fragment, an albumin analog, a Fc protein, a Fc fragment, or 
a Fc analog. These heterologous proteins can be 
successfully expressed in host cells, retain signaling 
activities associated with activation of the G-CSF receptor, 
and have prolonged half -lives. 

The following examples are presented to further 
describe the present invention. The scope of the present 
invention is not to be construed as merely consisting of the 
following examples. Those skilled in the art will recognize 
that the particular reagents, equipment, and procedures 
described are merely illustrative and are not intended to 
limit the present invention in any manner. 

EXAMPLES 

Example 1: Construction of DNA encoding glycosylated G-CSF 

analogs : 

Table 1 provides the sequence of primers used to create 
functional glycosylation sites in different regions of the 
protein (See Figure 1) . 



« 
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Table 1: Primer sequences used to introduce mutations into 
human G-CSF. 



Mutation 


A Primer* 


B Primer* 


C Primer* 


D Primer* 


* 

WT 


CF177 [SEQ ID 


CF178[SEQ ID 


CF179 [SEQ ID 


CF176 [SEQ ID 




NO : 2 5 ] 


NO:26l 


NO- 27] 


NO:28l 




GTAA GCTTGCGT 


GGGGCAGGGAGC 


GGACAGTGCAGG 


GAA CCTCGA GGA 




CGA CGCTAGCGG 


TGGCTGGGCCCA 


AAGCCACTCCAC 


TCCT C ATT AGGG 




CGCGCCGCCATG 


GTGGAGTGGCTT 


TGGGCCCAGCCA 


CTGGGCAAGGTG 




GCCGGACCTGCC 

\J w ^wwr^u x w w w 


CCTGCACTGTrr 


GCTCCCTGCCCC 

\J v*. X W V^. W* X \^ Vi_* \v 


CCTTAAGACGCG 




ACCCAGAGCCCC 


AGAGTGCACTGT 


AGAGCTTCCTG 


/^i rn Tv no 7v 0 7v /^l/^irn/^i 

GTACGACACCTC 




ATGAAGCTG 


G 




CAGGAAGCTCTG 


w X i r\ 


CF1 77 TSEO ID 


C17Arev TSEO 


G17Af or T SEO 


CF176 TSEO ID 


OCt W J- 


NO • ? 9 1 


ID NO -5 01 


ID NO -311 

X JmJ ±H V-/ • J -X J 


NO -321 




GTAA GCTTGCGT 


GCTCTAAGGCPT 


GGGGCCAGCGAG 


GAA CCTCGA GGA 




CGA CGCTAGCGG 


TG AG C AG G AAG C 


CTCCCTGCCCCA 

W X W *w W X VJVb« W W W^li 


TCCTCATTAGGG 

X W W X W^^ X X + ±\J\JsJ 




CGCGCCGCCATG 


TCTGGGGGAGGG 


GAGPTTGCTGCT 

VJilVJV^ X X WW X WW x 


CTGGGCAAGGTG 

w x vvwwinwy x \j 




GCCGGACCTGCC 


AGCTCGCTGGGr 

nu >w X WWW X WwVJV^ 


GAAGGCCTTAGA 


CCTTAAGACGCG 




AuLCAGAtjCLUU 


LLAb TGGAG 


GCAAG 






ATGAAGCTG 






CAGGAAGCTCTG 


A37N, Y39T 


CF177 [SEQ ID 


A37Nrev [SEQ 


A3 7Nf or [SEQ 


CF176 [SEQ ID 


5De J 

* — r *-»•* «!• 


NO: 33] 


ID NO:34] 


ID NO: 3 5] 


NO:36] 




GTAAGCTTGCGT 


GTCCGAGCAGCA 


GGCGCAGCGCTC 


GAACCTCGAGGA 




CGACGCTAGCGG 


CTAGTTCCTCGG 


CAGG AGAAG CTG 


TCCTCATTAGGG 




CGCGCCGCCATG 


GGTGGCACAGCT 


TGTAACACCACC 


CTGGG C7VAGGTG 




GCCGGACCTGCC 


TGGTGGTGTTAC 


AAG C TG TG C CAC 


CCTTAAGACGCG 




ACCCAGAGCCCC 


ACAGCTTCTCCT 


CCCGAGGAACTA 


GTACGACACCTC 




ATGAAGCTG 


G 


GTGCTG 


CAGGAAGCTCTG 


T133N, 


CF177 [SEQ ID 


T133Nrev[SEQ 


T133Nf or [SEQ 


CF176 [SEQ ID 


G135T 


NO:37] 


ID NO:38] 


ID NO:39] 


N0:40] 


Eco4 7 III 


GTAAGCTTGCGT 


GCCCGGCGCTGG 


GGCCCCTGCCCT 


GAA CCTCGA GGA 




CGACGCTAGCGG 


AAAGCGCTGGCG 


GCAGCCCAACCA. 


TCCTCATTAGGG 

* 




CGCGCCGCCATG 


AAGGCCGGCATG 


GACCGCCATGCC 


CTGGGCAAGGTG 




GCCGGACCTGCC 


GCGGTCTGGTTG 


GGCCTTCGCCAG 


CCTTAAGACGCG 




ACCCAGAGCCCC 


GGCTGCAGGGCA 


CGCTTTCCAGCG 


GTACGACACCTC 




ATGAAGCTG 


G 




CAGGAAGCTCTG 
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A141N, 


CF177 [SEQ ID 


A141Nrev[SEQ 


A141Nf or [SEQ 


CF176 [SEQ ID 


A143T 


NO:41] 


ID NO:42] 


ID NO:43] 


NO:44] 


Sap I 


GTAAGCTTGCGT 


GCCCGGCGCTGG 


GGGAATGGCCCC 


GAACCTCGAGGA 




CGACGCTAGCGG 


AAGGTAGAGTTG 


TGCTCTTCAGCC 


TCCTCATTAGGG 

* 




CGCGCCGCCATG 


AAGGCCGGCATG 


CACCCAGGGTGC 


CTGGGCAAGGTG 




GCCGGACCTGCC 


GCACCCTGGGTG 


CATGCCGGCCTT 


CCTTAAGACGCG 




ACCCAGAGCCCC 


GGCTGAAGAGCA 


CAACTCTACCTT 


GTACGACACCTC 




ATGAAGCTG 


GGGGCCAT 


CCAGCGCCGGGC 


CAGGAAGCTCTG 








AG 





P57V, 
W58N, P60T 

Hpal 



JCB128 [SEQ 
ID NO:45] 
GCTAGCGGCGCG 
CCACCATG 



Q67N,L69T 

Nael 



P60N,S62T 




JCB134 [SEQ 

ID NO:49] 
GCTAGCGGCGCG 
CCACCATGGCCG 
GACCTGCCACCC 
AG 

JCB128 [SEQ 
ID NO:53] 
GCTAGCGGCGCG 
CCACCATG 



JCB136 [SEQ 
ID NO:46] 
GCTCAGGGTAGC 
GTTAACGATGCC 
CAGAGAGTG 



JCB138 [SEQ 
ID NO: 50] 
CAAGCAGCCGGC 
CAGCTGGGTGGC 
GTTGCTGGGGCA 
GCTGCTCAG 
JCB130 [SEQ 
ID NO: 54] 
GGGGCAACTAGT 
CAGGTTAGCCCA 
GGG 



JCB137 [SEQ 
ID NO:47] 
GGGCATCGTTAA 
CGCTACCCTGAG 
CAGCTG 



JCB139 [SEQ 

ID NO:51] 
GCCCCAGCAACG 
CCACCCAGCTGG 
CCGGCTGCTTGA 



JCB131 [SEQ 
ID NO:55] 
GCTAACCTGACT 
AGTTGCCCCAGC 
CAG 



JCB129 [SEQ 

ID NO:48] 

GA CTCGAGGATC 

CTCATTAGGGCT 

GGG 



JCB135 [SEQ 

ID NO: 52] 

GA CTCGAGGATC 

CTCATTAGGGCT 

GGGCAAGGTGCC 

TTAAGACGCGG 

JCB129 [SEQ 

ID NO:56] 

GA CTCGAGGATC 

CTCATTAGGGCT 

GGG 



S63N, P65T 
Mfel 



E93N, I95T 
BspEI 



JCB12 8 [SEQ 
ID NO: 57] 
GCTAGCGGCGCG 
CCACCATG 



JCB132 [SEQ 
ID NO:58] 
GGTGCAATTGCT 
CAGGGGAGCCCA 



JCB133 [SEQ 

ID NO: 59] 
GCAATTGCACCA 

GCCAGGCCCTG 



JCB134 [SEQ 
ID NO: 61] 
GCTAGCGGCGCG 
CCACCATGGCCG 
GACCTGCCACCC 



JCB14 0 [SEQ 
ID NO: 62] 
CCGGACTGGTCC 
CGTTCAGGGCCT 
GCAGGAGCCCCT 



JCB141 [SEQ 
ID NO:63] 
GAACGGGACCAG 
TCCGGAGTTGGG 
TCCCACCTTGG 



JCB129 [SEQ 
ID NO:60] 
GA CTCGAGGATC 
CTCATTAGGGCT 

GGG 

JCB135 [SEQ 

ID NO:64] 

GA CTCGAGGATC 

CTCATTAGGGCT 

GGGCAAGGTGCC 
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AG 


G 




TTAAGACGCGG 




Sail 


JCB155 [SEQ 
ID NO: 65] 
GTCGACGCTAGC 
GGCGCGCCACCA 
TGGCCGGACCTG 


4 







^Nucleotides in bold represent changes imposed in the target 
sequence and nucleotides in bold and italics represent 
flanking sequences which may add restriction sites to 
facilitate cloning, Kozac sequences, pr stop codons . 



Preparation la: DNA encoding wild-type human G-CSF 

A strand overlapping extension PCR reaction was used to 
create a wild type human G-CSF construct in order to 
eliminate the methylation of an Apal site. Isolated human 
G-CSF cDNA served as the template for these reactions. The 
5' end A primer was used to create a restriction enzyme site 
prior to the start of the coding region as well as to 
introduce a Kozac sequence (GGCGCC) 5' of the coding leader 
sequence to faciliate translation in cell culture . 

The A-B product was generated using primers CF177 and 
CF178 in a PCR reaction. Likewise, the C-D product was 
produced with primers CF179 and CF176. The products were 
isolated and combined. The combined mixture was then used 
as a template with primers CF177 and CF178 to create the 
full-length wild-type construct. [Nelson, R.M. and Long, 
G.C. (1989), Anal. Biochem. 180:147-151]. 

The full-length product was ligated into the pCR2.1- 
Topo vector (Invitrogen, Inc. Cat. No. K4500-40) by way of a 
topoisomerase TA overhang system to create pCR2.1G-CSF. 

The following protocol was used for preparation of the 
full-length wild-type G-CSF protein as well as each of the 
G-CSF analogs. Approximately 5 ng of template DNA and 15 
pmol of each primer was used in the initial PCR reactions. 
The reactions were prepared using Platinum PCR Supermix® 
(GibcoBRL Cat. No. 11306-016). The PCR reactions were 
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denatured at 94°C for 5 min and then subject to 25 cycles 
wherein each cycle consisted of 30 seconds at 94°C followed 
by 30 seconds at 60°C followed by 30 seconds at 72°C. A 
final extension was carried out for 7 minutes at 72°C. PGR 
fragments were isolated from agarose gels and purified using 
a Qiaquick® gel extraction kit (Qiagen, Cat. No. #28706). 
DNA was resuspended in sterile water and used for the final 
PCR reaction to prepare full-length product . 

Preparation lb: DNA encoding G-CSF [A37N, Y39T, 
P57V,W58N,P60T,Q67N,L69T] was constructed as follows: 

DNA encoding G-CSF [A37N, Y3 9T, Q67N, L6 9T] was subcloned 
into pJB02 to create pJB02G-CSF [A37N, Y39T,Q67N,L69T] and 
pJB02G-CSF [A37N, Y39T, P57V, W58N, P60T] served as the template 
for strand overlapping expression PCR. JCB155 and JCB136 
served as the A and B primers and JCB137 and JCB135 served 
as the C and D primers. The full-length mutated cDNA was 
prepared as described previously using JCB155 and JCB134 
primers. The resulting full-length DNA encodes a protein 
with consensus N-linked glycosylation sites in region 1, 
region 2, and region 9 of the protein (See Figure 1). The 
full-length cDNA was ligated back into pCR2.1-Topo to create 
pCR2 . 1G-CSF [A37N, Y39T, P57V, W58N, P60T, 
Q67N,L69T] . 

Preparation lc: DNA encoding G-CSF [A37N, Y39T, 
S63N, P64T, E93N, I95T] was constructed as follows: 

DNA encoding G-CSF [A37N, Y3 9T, E93N, I95T] was subcloned 
into pJB02 to create pJB02G-CSF [A37N, Y39T, E93N, I95T] and 
pJB02G-CSF [A37N, Y39T,E93N, I95T] served as the template for 
strand overlapping expression PCR. JCB155 and JCB132 served 
as the A and B primers and JCB133 and JCB13 5 served as the C 
and D primers. The full-length mutated cDNA was prepared as 
described previously using JCB155 and JCB135 primers. The, 
resulting full-length DNA encodes a protein with consensus 
N-linked glycosylation sites in region 1, region .7, and 
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region 10 of the protein (See Figure 1) . The full-length 

cDNA was ligated back into pCR2.1-Topo to create pCR2.1G- 
CSF [A37N, Y39T, S63N, P64T, E93N, I95T] . 

Preparation Id: DNA encoding G-CSF[C17A] which is G-CSF 
wherein the amino acid at position 17 is substituted with 
Ala is constructed as follows: 

The wild- type construct in the pCR2.1~Topo vector 
(pCR2 .1G-CSF) serves as the PCR template for the C17A 
mutatgenesis . Strand ovelapping extension PCR is performed 
as described previously. CF177 and C17Arev serve as the A-B 
primers and C17Afor and CF176 serve as the C-D primers. The 
full-length mutated cDNA is prepared as described previously 
using the CF177 and CF176 primers. The B and C primers are 
used to mutate the DNA such that a SacI restriction site is 
created and the protein expressed from the full-length 
sequence contains an Alanine instead of a Cysteine at 
position 17. The full-length cDNA is ligated back into the 
pCR2.1-Topo vector to create pCR2 . 1G-CSF [C17A] wherein the 
sequence is confirmed. G-CSF analog encoding DNA is then 
cloned into the Nhe/Xho sites of mammalian expression vector 
pJB02 to create pJB02G-CSF [C17A] . 

Preparation le: DNA encoding G-CSF [A37N, Y3 9T] is 
constructed as follows: 

Strand overlapping extension PCR is performed using 
pCR2.1G-CSF[C17A] as the template. Primers CF177 and 
A3 7Nrev serve as the A-B primers and CF176 and A3 7Nfor serve 
as the C-D primers. The full-length mutated cDNA is prepared 
as described previously using the CF177 and CF176 primers. 
The B and C primers contain mismatched sequences such that a 
Spel site is created in the DNA and the protein expressed 
from the ful-l-length sequence contains a consensus sequence 
for N-linked glycosylation in region 1 of the protein. The 
full-length cDNA is ligated back into the pCR2.1-Topo vector 
to create pCR2 . 1G-CSF [A37N, Y39T] wherein the sequence is 
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confirmed. G-CSF analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 

pJB02G-CSF [A37N, Y39T] . 

Preparation If: DNA encoding G-CSF [P57V, VJ5 8N, P60T] is 

constructed as follows: 

Strand overlapping extension PCR is performed using 
pJB02G-CSF [C17A] as the template. Primers JCB128 and JCB136 
serve as the A-B primers and JCB137 and JCB129 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that a 
Hpal site is created and the protein expressed from the 
full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 2 of the protein. The. full- 
length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . 1G-CSF [P57V, W58N, P60T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 

pJB02G-CSF[P57V,W58N,P60T] . 

Preparation lg: DNA encoding G-CSF [P60N, S62T] is 
constructed as follows: 

Strand overlapping extension PCR is performed using 
pJB02G-CSP [C17A] as the template. Primers JCB128 and JCB130 
serve as the A-B primers and JCB131 and JCB129 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that a 
Spel site is created and the protein expressed from the 
full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 4 of the protein. The full- 
length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . 1G-CSF [P60N, S62T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then cloned into the 
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Nhe/Xho sites of mammalian expression vector pJB02 to create 

pJB02G-CSF[P60N,S62T] . 

Preparation lh: DNA encoding G-CSF [S63N, P65T] is 
constructed as follows: 

Strand overlapping extension PCR is performed using 
pJB02G-CSF [C17A] as the template. Primers JCB12 8 and JCB132 
serve as the A-B primers and JCB133 and JCB129 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that 
a Mfel site is created and the protein expressed from the 
full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 7 of the protein. The full- 
length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . 1G-CSF [S63N, P65T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 
pJB02G-CSF[S63N, P65T] . 

Preparation li: DNA encoding G-CSF [Q67N, L69T] is 
constructed as follows: 

Strand overlapping extension PCR is performed using 
pJB02G-CSF[C17A] as the template. Primers JCB134 and JCB138 
serve as the A-B primers and JCB139 and JCB135 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that 
a Nael site is created and the protein expressed from the 
full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 9 of the protein. The full- 
length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . 1G-CSF [Q67N # L69T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 

pJB02G-CSF[Q67N,L69T] . 
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Preparation lj : DNA encoding G-CSF [E93N, I95T] is 
constructed as follows: 

Strand overlapping extension PCR is performed using 
pJB02G-CSF [C17A] as the template. Primers JCB134 and JCB140 
serve as the A-B primers and JCB141 and JCB135 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that 
a BspEI site is created and the protein expressed' from the 
full-length sequence contains a consensus sequence for N- 
linked glycosylat ion in region 10 of the protein. The full- 
length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . 1G-CSF [E93N, I95T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then' cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 
pJB02G-CSF [E93T, I95T] . 

Preparation Ik: DNA encoding G-CSF [T133N, G135T] is 
constructed as follows: 

Strand overlapping extension PCR is performed using 
pCR2 . 1G-CSF [C17A] as the template. Primers CF177 and 
T133Nrev serve as the A-B primers and T133Nfor and CF176 
serve as the C-D primers. The full-length mutated cDNA is 
prepared as described previously using the CF177 and CF176 
primers. The B and C primers contain mismatched sequences 
such that an Eco47III site is created and the protein 
expressed from the full-length sequence contains a consensus 
sequence for N-linked glycosylation in region 13 of the 
protein. The full-length cDNA is ligated back into the 
pCR2.1-Topo vector to create pCR2 . 1G-CSF [T133N, G135T] 
wherein the sequence is confirmed. 

Preparation 11: DNA encoding G-CSF [A141N, A143T] is 
constructed as follows: 
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Strand overlapping extension PCR is performed using 
pCR2 .1G-CSP [C17A] as the template. Primers CF177 and 
A141Nrev serve as the A-B primers and A141Nfor and CF176 
serve as the C-D primers. The full-length mutated cDNA is 
prepared as described previously using the CF177 and CF176 
primers. The B and C primers contain mismatched sequences 
such that an Sapl site is created and the protein expressed 
from the full-length sequence contains a consensus sequence 
for N-linked glycosylation in region 14 of the protein. The 
full-length cDNA is ligated back into the pCR2.1-Topo vector 
to create pCR2 . 1G-CSF [A141N, A143T] wherein the sequence is 
confirmed. 

Preparation lm: DNA encoding G-CSF[A37N, Y39T,T133N, 
G135T] is constructed as follows: 

A 210 bp insert containing G-CSF [A37N, Y39T] is isolated 
from pCR2 . 1G-CSF [A37N, Y39T] using EcoNI. This fragment is 
ligated into pCR2 . 1G-CSF [T133N, G135T] which is prepared by 
cleavage with EcoNI and subsequent isolation of the vector 
(4359 bp) from a 210 bp fragment containing wild-type G-CSF 
sequences. This ligation creates pCR2.1G- 
CSF[A37N,Y39T,T133N,G135T] . Analog encoding DNA is then 
subcloned into pJB02 using Nhel/Xhol to create pJB02G- 
CSF[A37N,Y39T,T133N,G135T] . 

Preparation In: DNA encoding G-CSF [A3 7N, Y39T, A141N, 
A143T] is constructed as follows: 

A 210 bp insert containing G-CSF [A37N, Y39T] is isolated 
from pCR2.1G-CSF[A37N / Y39T] using EcoNI, This fragment is 
ligated into pCR2 . 1G-CSF [A141N, A143T] which is prepared by 
cleavage with EcoNI and subsequent isolation of the vector 
(4359 bp) from a 210 bp fragment containing wild-type G-CSF 
sequences. This ligation creates pCR2.1G- 
CSF[A37N / Y39T,A141N / A143T] . Analog encoding DNA is then 
subcloned into pJB02 (Figure 3) using Nhel/Xhol to create 

pJB02G-CSF[A37N,Y3 9T,A141N,A143T] . 
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Preparation lo: DNA encoding G-CSF [A37N, Y39T, P57V, W58N, 
P60T] is constructed as follows: 

DNA encoding G-CSF [A37N, Y39T] is subcloned into pJB02 
to create pJB02G-CSF [A37N, Y39T] and pJB02G-CSF [A37N, Y39T] 
serves as the template for strand overlapping expression 
PCR. JCB12 8 and JCB13 6 serve as the A and B primers and 
JCB137 and JCB129 serve as the C and D primers. The full- 
length mutated cDNA is prepared as described previously 
using JCB128 and JCB129 primers. The resulting full-length 
DNA encodes a protein with consensus N-linked glycosylation 
sites in region 1 and region 2 of the protein. The full- 
length cDNA is ligated back into pCR2.1-Topo to create 
pCR2.1G-CSF[A37N,Y39T,P57V,W58N,P60T] . 

Preparation lp: DNA encoding G-CSF [A37N / Y3 9T, Q67N, L69T] 
is constructed as follows: 

DNA encoding G-CSF [A37N / Y39T] is subcloned into pJB02 
to create pJB02G-CSF [A37N, Y39T] and pJB02G-CSF [A37N, Y39T] 
serves as the template for strand overlapping expression 
PCR. JCB134 and JCB13 8 serve as the A and B primers and 
JCB139 and JCB13 5 serve as the C and D primers. The full- 
length mutated cDNA is prepared as described previously 
using JCB128 and JCB129 primers. The resulting full-length 
DNA encodes a protein with consensus N-linked glycosylation 
sites in region 1 and region 9 of the protein. ...The full- 
length cDNA is ligated back into pCR2.1-Topo to create 
pCR2.1G-CSF[A37N,Y39T,Q67N,L69T] . 

Preparation lq: DNA encoding G-CSF [A3 7N, Y3 9T, E93N, I95T] 
is constructed as follows: 

DNA encoding G-CSF [A3 7N, Y39T] is subcloned into pJB02 to 
create pJB02G-CSF [A37N, Y39T] and pJB02G-CSF [A37N, Y39T] 

serves as the template for strand overlapping expression 

* 

PCR. JCB134 and JCB140 serve as the A and B primers and 
JCB141 and JCB135 serve as the C and D primers.' The full- 
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length mutated cDNA is prepared as described previously 
using JCB128 and JCB129 primers. The resulting full-length 
DNA encodes a protein with consensus N-linked glycosylation 
sites in region 1 and region 10 of the protein. The full- 
length cDNA is ligated back into pCR2.1-Topo to create 

pCR2 . 1G-CSF [A37N, Y39T, E93N, I95T] . 

Example 2: Expression of heterologus fusion proteins: 

2a: Expression in 293/EBNA cells: 

Each full-length DNA encoding a G-CSF analog was 
subcloned into the Nhel/Xhol sites of mammalian expression 
vector pJB02 (Figure 3) . This vector contains both the Ori 
P and Epstein Barr virus nuclear antigen (EBNA) components 
which are necessary for sustained, transient expression in 
293 EBNA cells. This expression plasmid contains a 
puromycin resistance gene expressed from the CMV promoter as 
well as an ampicillin resistance gene. The gene of interest 
is also expressed from the CMV promoter. 

The transfection mixture was prepared by mixing 73 \il 
of the liposome transfection agent Fugene 6® (Roche 
Molecular Biochemicals , Cat. No. 1815-075) with 820 |ll Opti- 
Mem® (GibcoBRL Cat. No. 31985-062) . G-CSF pJB02 DNA 
(12jig) , prepared using a Qiagen plasmid maxiprep kit 
(Qiagen, Cat. No. 12163), was then added to the mixture. 
The mixture was incubated at room temperature for 15 
minutes . 

Cells were plated on 10 cm 2 plates in DMEM/F12 3:1 
(GibcoBRL Cat. No. 93-0152DK) supplemented with 5% fetal 
bovine serum, 20mM HEPES, 2 mM L-glutamine, and 50> jig/mL 
Geneticin such that the plates were 60% to 80% confluent by 
the time of the transfection. Immediately before the 
transfection mixture was added to the plates, fresh media 
was added. The mixture was then added dropwise to cells 
with intermittent swirling. Plates were then incubated at 
37°C in a 5% C0 2 atmosphere for 24 hours at which point the 
media was changed to Hybritech medium without serum. The 
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media containing a secreted form of a glycosylated G-CSF 
analog was then isolated 48 hours later. 

• * 

2b: Expression in CHO cells: 

The expression vector for expression in CH0-K1 cells pEE14 . 1 
is illustrated in Figure 4. This vector includes the 
glutamine synthetase gene which enables selection using 
methionine sulfoximine. This gene includes two poly A 
signals at the 3' end. G-CSF analogs are expressed from the 
CMV promoter which includes 5' untranslated sequences from 
the hCMV-MIE gene to enhance mRNA levels and 

translatability . The SV40 poly A signal is cloned 3' of the 
G-CSF analog DNA. The SV40 late promoter drives expression 
of GS minigene. This expression vector encoding the gene 
of interest was prepared for transfection using a QIAGEN 
Maxi Prep Kit (QIAGEN, Cat. No. 123 62) . The final DNA 
pellet (50-100 |ig) was resuspended in 100 (il of basal 
formulation medium (GibcoBRL CD-CHO Medium without L- 
Glutamine, without thymidine, without hypoxanthine) . 
Before each transfection, CHO-K1 cells were counted and 
checked for viability. A volume equal to 1 x 10 7 cells was 
centrifuged and the cell pellet rinsed with basal 
formulation medium. The cells were centrifuged a second 
time and the final pellet resuspended in basal formulation 
medium (700 \il final volume) . 

The resuspended DNA and cells were then mixed together 
in a standard electroporation cuvette (Gene Pulsar Cuvette) 
used to support mammalian transf ections , and placed on ice 
for five minutes. The cell/DNA mix was then electroporated 
in a BioRad Gene Pulsar device set at 300V/975 JJ.F and the 
cuvette placed back on ice for five minutes. The cell/DNA 
mixed was then diluted into 20 ml of cell growth medium in a 
non-tissue culture treated T75 flask and incubated at 37°C / 
5% C0 2 for 48-72 hours. 

The cells were counted, checked for viability, and 
plated at various cell densities in selective medium in 96 
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well tissue culture plates and incubated at 37°C in a 5% C0 2 
atmosphere. Selective medium is basal medium with IX HT 
Supplement (GibcoBRL 100X HT Stock) , 100 jig/mL Dextran 
Sulfate (Sigma 100 mg/ml stock) , IX GS Supplements (JRH 
Biosciences SOX Stock) and 25 |iM MSX (Methionine 
Sulphoximine) . The plates were monitored for colony 
formation and screened for glycosylated G-CSF analog 
production. 

Example 3: Purification of Heterologous Fusion Proteins 
HA Fusions 

The cell culture harvest was dialyzed against 20 mM 
Tris pH 7.4. An anion exchange column (1 ml Pharmacia 
HiTrap Q) was equilibrated with 20 mM Tris pH7 . 4 and the 
dialyzed material loaded at 2 ml/min. The protein was 
eluted from the column using a linear gradient from 0 to 500 
mM NaCl in 80 min at 1 ml/min and elution was monitored by 
UV absorbance at 280 mm. SDS-PAGE analysis was used to 
identify and pool fractions of interest. This pool was 
dialyzed against 25 mM sodium acetate (NaOAc) pH 5.0 

A cation exchange column (1 ml Pharmacia HiTrap S 
column) was equilibrated with 25 mM -NaOAc pH 5.0 and the 
dialysate was loaded at 1 ml/min. The protein was eluted 
from the column using a linear gradient from 0 to 500 mM 
NaCl in 30 min. The fractions were immediately neutralized 
with 1 M Tris pH 8 to a final pH of 7 . SDS-PAGE gels were 
used to identify and pool fractions of interest. 

Fc Fusions 

The cell culture harvest was dialyzed against 20 mM 
sodium phosphate pH 7.0. An affinity column (1 ml Pharmacia 
HiTrap Protein A or rProtein A) was equilibrated with 20 mM 
sodium phosphate pH 7.0 and the dialysate was loaded at 2 
ml/min. 1 ml/min of 100 mM citric acid pH 3 was used to 
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elute the protein. Fractions were immediately neutralized 
with 1M Tris pH 8 to pH 7 and peak fractions (determined by 
in-line OD280 monitoring) were further diluted with 20 mM 
sodium phosphate pH 7.0. SDS-PAGE analysis was used to 
identify and pool fractions of interest. 



* 
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WE CLAIM : 

1. A heterologous fusion protein comprising a 
hyperglycosylated G-CSF analog fused to a polypeptide 
selected from the group consisting of 

d) human albumin; 

e) human albumin analogs; and 

f) fragments of human albumin. 



2. The heterologous fusion protein of claim 1, wherein the 
hyperglycosylated G-CSF analog is fused to the polypeptide 
via a peptide linker. 

3. The heterologous fusion protein of the Claim 2 wherein 
the peptide linker is selected from the group consisting of: 

c) a glycine rich peptide; 

d) a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] n 
where nisi, 2, 3, 4, or 5; and 

e) a peptide having the sequence [Gly-Gly-Gly-Gly- 
Ser] 3 . 

4. The heterologous fusion protein of Claims 1, 2, or 3 
wherein the hyperglycosylated G-CSF analog comprises the 



amino < 


acid sequence 


of the formula I 


: [SEQ 


ID : 


NO: 


1] 


l 






5 






10 








15 




Thr Pro 


Leu 


Gly 


Pro 


Ala 


Ser Ser Leu 


Pro 


Gin 


Ser Phe 


Leu 


Leu 


Lys 






20 






25 








30 






Xaa Leu 


Glu 


Gin 


Val 


Arg 


Lys lie Gin 


Gly 


Asp 


Gly Ala Ala 


Leu 


Gin 




35 








40 






45 








Glu Lys 


Leu 


Cys 


Xaa 


Xaa 


Xaa Lys Leu 


Cys 


His 


Pro Glu 


Glu 


Leu 


Val 


50 










55 






60 








Leu Leu Gly 


His 


Ser 


Leu 


Gly He Xaa 


Xaa 


Xaa 


Xaa Xaa 


Xaa 


Xaa 


Xaa 


65 








70 






75 








60 


Xaa Xaa 


Xaa 


Xaa 


Xaa 


Gin 


Leu Ala Gly 


Cys 


Leu 


Ser Gin 


Leu 


His 


Ser 








85 






90 








95 




Gly Leu 


Phe 


Leu 


Tyr 


Gin 


Gly Leu Leu 


Gin 


Ala 


Leu Xaa 


Xaa 


Xaa 


Ser 






100 






105 








110 






Xaa Glu 


Leu 


Gly 


Pro 


Thr 


Leu Asp Thr 


Leu 


Gin 


Leu Asp 


val 


Ala 


Asp 




115 








120 






125 








Phe Ala 


Thr 


Thr 


lie 


Trp 


Gin Gin Met 


Glu 


Glu 


Leu Gly Met 


Ala 


Pro 
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130 135 
Ala Leu Gin Pro Xaa Xaa Xaa Ala 
145 150 
Gin Arg Arg Ala Gly Gly Val Leu 

165 

Leu Glu Val Ser Tyr Arg Val Leu 



-58- 

140 

Met Pro Ala Phe Xaa Xaa Xaa Phe 

155 160 
Val Ala Ser His Leu Aln Ser Phe 

170 

Arg His Leu Ala Gin Pro (I) 



wherein: - 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro ; 

Xaa at position 
Xaa at position 
Pro; 

Xaa at position' 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro ; and 
Xaa at position 



17 is Cys, Ala, Leu, Ser, or Glu; 
37 is Ala or Asn; 

3 8 is Thr, or any other amino acid except 

39 is Tyr, Thr, or Ser; 

57 is Pro or Val; 

58 is Trp or Asn; 

59 is Ala or any other amino acid except 

60 is Pro, Thr, Asn, or Ser, 

61 is Leu, or any other amino acid except 

62 is Ser or Thr; 

63 is Ser or Asn; 

64 is Cys or any other amino acid except 

65 is Pro, Ser, or Thr; 

66 is Ser or Thr; 

67 is Gin or Asn; 

6 8 is Ala or any other amino acid except 

69 is Leu, Thr, or Ser 

93 is Glu or Asn 

94 is Gly or any other amino acid except 



95 is lie, Asn, Ser, or Thr; 
97 is Pro, Ser, Thr, or Asn; 
13 3 is Thr or Asn; 

134 is Gin or any other amino acid exc 



135 is Gly, Ser, or Thr 

141 is Ala or Asn; 

142 is Ser or any other amino acid except 

143 is Ala, Ser, or Thr; 




and wherein: 

Xaa at positions 37, 38, and 39 constitute region 1 

Xaa at positions 58, 59, and 60 constitute region 2 

Xaa at positions 59, 60, and 61 constitute region 3 
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Xaa 


at 


positions 


60, 


61, 


and 


62 


constitute 


region 


4; 


Xaa 


at 


positions 


61, 


62, 


and 


63 


constitute 


region 


5; 


Xaa 


at 


positions 


62, 


63, 


and 


64 


constitute 


region 


6; 


Xaa 


at 


positions 


63, 


64, 


and 


65 


constitute 


region 


7; 


Xaa 


at 


positions 


64, 


65, 


and 


66 


constitute 


region 


8; 


Xaa 


at 


positions 


67, 


68, 


and 


69 


constitute 


region 


9; 


Xaa 


at 


positions 


93, 


94, 


and 


95 


constitute 


region 


10; 


Xaa 


at 


positions 


94 , 


95, 


and 


Ser at position 96 





constitute region 11; 
Xaa at positions 95, and 97, and Ser at position 96 

constitute region 12; 
Xaa at positions 133, 134, and 135 constitute 



region 13; 

Xaa at positions 141, 142, and 143 constitute 
region 14 ; 

and provided that at least one of regions 1 through 14 
comprises the sequence Asn Xaal Xaa2 wherein Xaal is any 
amino acid except Pro and Xaa2 is Ser or Thr. 

5. The heterologous fusion protein of Claim 4 wherein any 
two regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 

6. The heterologous fusion protein of Claim 4 wherein any 
three regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 

7. The heterologous fusion protein of Claim 4 wherein any 
four regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 
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8. The heterologous fusion protein of Claim 4 wherein the 
hyperglycosylated G-CSF analog is selected from the group 
consisting of: 



a) 


G- 


CSF[A37N,Y39T] 


b) 


G- 


CSF[P57V,W58N,P60T] 


c) 


G- 


CSF[P60N,S62T] 


d) 


G- 


CSF[S63N,P65T] 


P ) 


G- 


CSF [067N, L69T] 


f ) 


G- 


CSF[E93N / I95T] 


g) 


G- 


CSF[T133N,G135T] 


h) 


G- 


CSF [A141N,A143T] 


1 ) 


G- 


CSF[A3 7N, Y3 9T,P57V,W58N, P6 0T] 


d) 


G- 


CSF [A37N, Y39T, P60N, S62T] 


k) 


G- 


CSF [A3 7N, Y3 9T, S63N, P65T] 


1) 


G- 


CSF [A3 7N, Y39T,Q67N,L69T] 


m) 


G- 


CSF [A3 7N, Y39T, E93N, I95T] 


n) 


G- 


CSF [A3 7N,Y39T,T133N,G135T] 


o) 


G- 


CSF [A3 7N, Y39T,A141N, A143T] 



q 

r 



p) G-CSF [A3 7N,Y3 9T,P57V,W5 8N,P60T,S63N, P65T] 
G-CSF [A3 7N, Y39T, P57V, W58N, P6 0T, Q67N, L6 9T] 
G-CSF [A3 7N,Y3 9T,S63N,P65T,E93N, I95T] . 



9. The heterologous fusion protein of claim 8, wnerein the 
hyperglycosylated G-CSF analog is G-CSF [A37N, 
Y39T, P57V, W58N, P6 0T, Q67N, L69T] . 



10. The heterologous fusion protein of claim 8, wherein the 
hyperglycosylated G-CSF analog is G- 

CSF [A37N, Y39T,S63N,P65T,E93N, I95T] . 



11. A heterologous fusion protein which is the product of 
the expression in a host cell of an exogenous DNA sequence 
which comprises a DNA sequence encoding a heterologous 
fusion protein of any one of Claims 1 through 11. 



12. An isolated nucleic acid sequence, comprising a 
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polynucleotide encoding a heterologous fusion protein of any 
one of Claims 1 through 11. 

13. An isolated nucleic acid sequence, comprising a 
polynucleotide which comprises a DNA sequence selected from 
the group consisting of: 

a) SEQ ID NO: 2 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC AAC CAG ACC GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TTG GTC TGG CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

b) SEQ ID NO: 3 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
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GGG TCG GTC CGG GAC GTC GAC CGT 

GGC CTT TTC CTC TAC CAG GGG CTC 
CCG GAA AAG GAG ATG GTC CCC GAG 

CCC GAG TTG GGT CCC ACC TTG GAC 
GGG CTC AAC CCA GGG TGG AAC CTG 

TTT GCC ACC ACC ATC TGG CAG CAG 
AAA CGG TGG TGG TAG ACC GTC GTC 

GCC CTG CAG CCC ACC CAG GGT GCC 
CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 

C) SEQ ID NO: 4 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 

GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT AAC ACC ACC AAG 
CTC TTC GAC ACA TTG TGG TGG TTC 

CTG CTC GGA CAC TCT CTG GGC ATC 
GAC GAG CCT GTG ACA GAC CCG TAG 

CCC AGC CAG GCC CTG CAG CTG GCA 
GGG TCG GTC CGG GAC GTC GAC CGT 

GGC CTT TTC CTC TAC CAG GGG CTC 
CCG GAA AAG GAG ATG GTC CCC GAG 

CCC GAG TTG GGT CCC ACC TTG GAC 
GGG CTC AAC CCA GGG TGG AAC CTG 

TTT GCC ACC ACC ATC TGG CAG CAG 
AAA CGG TGG TGG TAG ACC GTC GTC 

GCC CTG CAG CCC ACC CAG GGT GCC 
CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 

d) SEQ ID NO: 5 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 



-62- 

CCG ACG AAC TCG GTT GAG GTA TCG 

CTG CAG GCC CTG GAA GGG ATC TCC 
GAC GTC CGG GAC CTT CCC TAG AGG 

ACA CTG CAG CTG GAC GTC GCC GAC 
TGT GAC GTC GAC CTG CAG CGG CTG 

ATG GAA GAA CTG GGA ATG GCC CCT 
TAC CTT CTT GAC CCT TAC CGG GGA 

ATG CCG GCC TTC AAC TCT ACC TTC 
TAC GGC CGG AAG TTG AGA TGG AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 

CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 

CCC TGG GCT CCC CTG AGC AGC TGC 
GGG ACC CGA GGG GAC TCG TCG ACG 

GGC TGC TTG AGC CAA CTC CAT AGC 
CCG ACG AAC TCG GTT GAG GTA TCG 

CTG CAG GCC CTG GAA GGG ATC TCC 
GAC GTC CGG GAC CTT CCC TAG AGG 

ACA CTG CAG CTG GAC GTC GCC GAC 
TGT GAC GTC GAC CTG CAG CGG CTG 

ATG GAA GAA CTG GGA ATG GCC CCT 
TAC CTT CTT GAC CCT TAC CGG GGA 

ATG CCG GCC TTC GCC TCT GCT TTC 
TAC GGC CGG AAG CGG AGA CGA AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 
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GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAG AAG CTG TGC CAC CCC GAG GAG CTG' GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT AAC ACT AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA TTG GAC TCC TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

e) SEQ ID NO:6 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC .GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AAT TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TTA ACG 

ACC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
TGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC * ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 
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CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

f) SEQ ID NO: 7 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 



GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT GCC ACC TAC AAG 
CTC TTC GAC ACA CGG TGG ATG TTC 

CTG CTC GGA CAC TCT CTG GGC ATC 
GAC GAG CCT GTG ACA GAC CCG TAG 

CCC AGC CAG GCC CTG CAG CTG GCA 
GGG TCG GTC CGG GAC GTC GAC CGT 

GGC CTT TTC CTC TAC CAG GGG CTC 
CCG GAA AAG GAG ATG GTC CCC GAG 

CCC GAG TTG GGT CCC ACC TTG GAC 
GGG CTC AAC CCA GGG TGG AAC CTG 

TTT GCC ACC ACC ATC TGG CAG CAG 
AAA CGG TGG TGG TAG ACC GTC GTC 

GCC CTG CAG CCC ACC CAG GGT GCC 
CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 



CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 

GTT AAC GCT ACC CTG AGC AGC TGC 
CAA TTG CGA TGG GAC TCG TCG ACG 

GGC TGC TTG AGC CAA CTC CAT AGC 
CCG ACG AAC TCG- GTT GAG GTA TCG 

CTG CAG GCC CTG GAA GGG ATC TCC 
GAC GTC CGG GAC CTT CCC TAG AGG 

ACA CTG CAG CTG GAC GTC GCC GAC 
TGT GAC GTC GAC CTG CAG CGG CTG 

ATG GAA GAA CTG GGA ATG GCC CCT 
TAC CTT CTT GAC CCT TAC CGG GGA 

ATG CCG GCC TTC GCC TCT GCT TTC 
TAC GGC CGG AAG CGG AGA CGA AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



g) SEQ ID NO: 8 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 

GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT GCC ACC TAC AAG 
CTC TTC GAC ACA CGG TGG ATG TTC 

CTG CTC GGA CAC TCT CTG GGC ATC 
GAC GAG CCT GTG ACA GAC CCG TAG 

CCC AGC AAC GCC ACC CAG CTG GCA 
GGG TCG TTG CGG TGG GTC GAC CGT 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 

CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 

CCC TGG GCT CCC CTG AGC AGC TGC 
GGG ACC CGA GGG GAC TCG TCG ACG 

GGC TGC TTG AGC CAA CTC CAT AGC 
CCG ACG AAC TCG GTT GAG GTA TCG 



GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
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CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

h) SEQ ID NO: 9 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA CGG TGG ATG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG AAC GGG ACC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC TTG CCC TGG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA . CGG GTC GGG 

i) SEQ ID NO: 10 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 
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GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA TTG' TGG- TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 

CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC AAC CAG ACC GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TTG GTC TGG CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

j) SEQ ID NO: 11 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TCG ACG 



CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC AAC TCT ACC TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG TTG AGA TGG AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 
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CTG GAG GTG TCG TAC CGC 
GAC CTC CAC AGC ATG GCG 

k) SEQ ID NO: 12 

ACC CCC CTG GGC CCT GCC 

TGG GGG GAC CCG GGA CGG 

GCC TTA GAG CAA GTG AGG 
CGG AAT CTC GTT CAC TCC 

GAG AAG CTG TGT AAC ACC 
CTC TTC GAC ACA TTG TGG 

CTG CTC GGA CAC TCT CTG 
GAC GAG CCT GTG ACA GAC 

CCC AGC CAG GCC CTG CAG 
GGG TCG GTC CGG GAC GTC 

GGC CTT TTC CTC TAC CAG 
CCG GAA AAG GAG ATG GTC 

CCC GAG TTG GGT CCC ACC 
GGG CTC AAC CCA GGG TGG 

TTT GCC ACC ACC ATC TGG 
AAA CGG TGG TGG TAG ACC 

GCC CTG CAG CCC ACC CAG 
CGG GAC GTC GGG TGG GTC 

CAG CGC CGG GCA GGA GGG 
GTC GCG GCC CGT CCT CCC 

CTG GAG GTG TCG TAC CGC 
GAC CTC CAC AGC ATG GCG 

1) SEQ ID NO: 13 

ACC CCC CTG GGC CCT GCC 

TGG GGG GAC CCG GGA CGG 



GCC TTA GAG CAA GTG AGG 
CGG AAT CTC GTT CAC TCC 

GAG AAG CTG TGT AAC ACC 
CTC TTC GAC ACA TTG TGG 

CTG CTC GGA CAC TCT CTG 
GAC GAG CCT GTG ACA GAC 

CCC AGC AAC GCC ACC CAG 
GGG TCG TTG CGG TGG GTC 

GGC CTT TTC CTC TAC CAG 
CCG GAA AAG GAG ATG GTC 
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GTC TTA AGG CAC CTT 
CAG AAT TCC GTG GAA 



AGC TCC CTG CCC CAG 
TCG AGG GAC GGG GTC 

AAG ATC CAG GGC GAT 
TTC TAG GTC CCG CTA 

ACC AAG CTG TGC CAC 
TGG TTC GAC ACG GTG 

GGC ATC GTT AAC GCT 
CCG TAG CAA TTG CGA 

CTG GCA GGC TGC TTG 
GAC CGT CCG ACG AAC 

GGG CTC CTG CAG GCC 
CCC GAG GAC GTC CGG 

TTG GAC ACA CTG CAG 
AAC CTG TGT GAC GTC 

CAG CAG ATG GAA GAA 
GTC GTC TAC CTT CTT 

GGT GCC ATG CCG GCC 
CCA CGG TAC GGC CGG 

GTC CTG GTT GCC TCC 
CAG GAC CAA CGG AGG 

GTC TTA AGG CAC CTT 
CAG AAT TCC GTG GAA 



AGC TCC CTG CCC CAG 
TCG AGG GAC GGG GTC 



AAG ATC CAG GGC GAT 
TTC TAG GTC CCG CTA 

ACC AAG CTG TGC CAC 
TGG TTC GAC ACG GTG 

GGC ATC CCC TGG GCT 
CCG TAG GGG ACC CGA 

CTG GCA GGC TGC TTG 
GAC CGT CCG ACG AAC 

GGG CTC CTG CAG GCC 
CCC GAG GAC GTC CGG 
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GCC CAG CCC 
CGG GTC GGG 



AGC TTC CTG CTC AAG 
TCG AAG GAC GAG TTC 

GGC GCA GCG CTC CAG 
CCG CGT CGC GAG GTC 

CCC GAG GAG CTG GTG 
GGG CTC CTC GAC CAC 

ACC CTG AGC AGC TGC 
TGG GAC TCG TCG ACG 

AGC CAA CTC CAT AGC 
TCG GTT GAG GTA TCG 

CTG GAA GGG ATC TCC 
GAC CTT CCC TAG AGG 

CTG GAC GTC GCC GAC 
GAC CTG CAG CGG CTG 

CTG GGA ATG GCC CCT 
GAC CCT TAC CGG GGA 

TTC GCC TCT GCT TTC 
AAG CGG AGA CGA AAG 

CAT CTG CAG AGC TTC 
GTA GAC GTC TCG AAG 

GCC CAG CCC 
CGG GTC GGG 



AGC TTC CTG CTC AAG 
TCG AAG GAC GAG TTC 



GGC GCA GCG CTC CAG 
CCG CGT CGC GAG GTC 

CCC GAG GAG CTG GTG 
GGG CTC CTC GAC CAC 

CCC CTG AGC AGC TGC 
GGG GAC TCG TCG ACG 

AGC CAA CTC CAT AGC 
TCG GTT GAG GTA TCG 

CTG GAA GGG ATC TCC 
GAC CTT CCC TAG AGG 
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CCC GAG TTG GGT CCC ACC TTG GAC 
GGG CTC AAC CCA GGG TGG AAC CTG 

TTT GCC ACC ACC ATC TGG CAG CAG 
AAA CGG TGG TGG TAG ACC GTC GTC 

GCC CTG CAG CCC ACC CAG GGT GCC 
CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 

m) SEQ ID NO: 14 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 

GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT AAC ACC ACC AAG 
CTC TTC GAC ACA TTG TGG TGG TTC 

CTG CTC GGA CAC TCT CTG GGC ATC 
GAC GAG CCT GTG ACA GAC CCG TAG 

CCC AGC CAG GCC CTG CAG CTG GCA 
GGG TCG GTC CGG GAC GTC GAC CGT 

GGC CTT TTC CTC TAC CAG GGG CTC 
CCG GAA AAG GAG ATG GTC CCC GAG 

AAC GGT ACC GGT CCC ACC TTG GAC 
TTG CCA TGG CCA GGG TGG AAC CTG 

TTT GCC ACC ACC ATC TGG CAG CAG 
AAA CGG TGG TGG TAG ACC GTC GTC 

GCC CTG CAG CCC ACC CAG GGT GCC 
CGG GAC GTC GGG TGG GTC CCA CGG 

CAG CGC CGG GCA GGA GGG GTC CTG 
GTC GCG GCC CGT CCT CCC CAG GAC 

CTG GAG GTG TCG TAC CGC GTC TTA 
GAC CTC CAC AGC ATG GCG CAG AAT 

n) SEQ ID NO: 15 

ACC CCC CTG GGC CCT GCC AGC TCC 
TGG GGG GAC CCG GGA CGG TCG AGG 

GCC TTA GAG CAA GTG AGG AAG ATC 
CGG AAT CTC GTT CAC TCC TTC TAG 

GAG AAG CTG TGT AAC ACC ACC AAG 
CTC TTC GAC ACA TTG TGG TGG TTC 
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ACA CTG CAG CTG GAC GTC GCC GAC 
TGT GAC GTC GAC CTG CAG CGG CTG 

ATG GAA GAA CTG GGA ATG GCC CCT 
TAC CTT CTT GAC CCT TAC CGG GGA 

ATG CCG GCC TTC GCC TCT GCT TTC 
TAC GGC CGG AAG CGG AGA CGA AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 

CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 

CCC -TGG GCT CCC CTG AGC AGC TGC 
GGG ACC CGA GGG GAC TCG TCG ACG 

GGC TGC TTG AGC CAA CTC CAT AGC 
CCG ACG AAC TCG GTT GAG GTA TCG 

CTG CAG GCC CTG GAA GGG ATC TCC 
GAC GTC CGG GAC CTT CCC TAG AGG 

ACA CTG CAG CTG GAC GTC GCC GAC 
TGT GAC GTC GAC CTG CAG CGG CTG 

ATG GAA GAA CTG GGA ATG GCC CCT 
TAC CTT CTT GAC CCT TAC CGG GGA 

ATG CCG GCC TTC GCC TCT GCT TTC 
TAC GGC CGG AAG CGG AGA CGA AAG 

GTT GCC TCC CAT CTG CAG AGC TTC 
CAA CGG AGG GTA GAC GTC TCG AAG 

AGG CAC CTT GCC CAG CCC 
TCC GTG GAA CGG GTC GGG 



CTG CCC CAG AGC TTC CTG CTC AAG 
GAC GGG GTC TCG AAG GAC GAG TTC 

CAG GGC GAT GGC GCA GCG CTC CAG 
GTC CCG CTA CCG CGT CGC GAG GTC 

CTG TGC CAC CCC GAG GAG CTG GTG 
GAC ACG GTG GGG CTC CTC GAC CAC 
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CTG CTC GGA CAC TCT CTG GGC ATC GTT AAC GCT ACC CTG AGC AGC TGC 
GAC GAG CCT GTG ACA GAC CCG TAG CAA TTG CGA TGG GAC TCG TCG ACG 

CCC AGC AAC GCC ACC CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
GGG TCG TTG CGG TGG GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG' CTC CTG CAG GCC CTG GAA GGG ATC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC CTT CCC TAG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG . GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG 

o) SEQ ID NO: 16 

ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC AAG 
TGG GGG GAC CCG GGA CGG TCG AGG GAC GGG GTC TCG AAG GAC GAG TTC 

GCC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG 
CGG AAT CTC GTT CAC TCC TTC TAG GTC CCG CTA CCG CGT CGC GAG GTC 

GAG AAG CTG TGT AAC ACC ACC AAG CTG TGC CAC CCC GAG GAG CTG GTG 
CTC TTC GAC ACA TTG TGG TGG TTC GAC ACG GTG GGG CTC CTC GAC CAC 

CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AAT TGC 
GAC GAG CCT GTG ACA GAC CCG TAG GGG ACC CGA GGG GAC TCG TTA ACG 

ACC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC 
TGG TCG GTC CGG GAC GTC GAC CGT CCG ACG AAC TCG GTT GAG GTA TCG 

GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG AAC GGG ACC TCC 
CCG GAA AAG GAG ATG GTC CCC GAG GAC GTC CGG GAC TTG CCC TGG AGG 

CCC GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC 
GGG CTC AAC CCA GGG TGG AAC CTG TGT GAC GTC GAC CTG CAG CGG CTG 

TTT GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT 
AAA CGG TGG TGG TAG ACC GTC GTC TAC CTT CTT GAC CCT TAC CGG GGA 

GCC CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC 
CGG GAC GTC GGG TGG GTC CCA CGG TAC GGC CGG AAG CGG AGA CGA AAG 

CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC 
GTC GCG GCC CGT CCT CCC CAG GAC CAA CGG AGG GTA GAC GTC TCG AAG 

CTG GAG GTG TCG TAC CGC GTC TTA AGG CAC CTT GCC CAG CCC 
GAC CTC CAC AGC ATG GCG CAG AAT TCC GTG GAA CGG GTC GGG , 
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to 

fused to a DNA encoding a protein selected from the group 
consisting of: 

a) human albumin, 

b) human albumin analog; and 

c) fragments of human albumin. 

14. The heterologous fusion protein of any one of Claims 1 
through 11 wherein the polypeptide is human albumin . 

15. The heterologous fusion protein of Claim 12, wherein 
the second polypeptide has the sequence of SEQ ID NO: 35. 

16. The heterologous fusion protein of any one of Claims 1 
through 11 wherein the second polypeptide is an N-terminal 
fragment of albumin. 

17. A method for increasing neutrophil levels in a 
mammal comprising the administration of a therapeutically 
effective amount of the heterologous fusion protein of any 
one of Claims 1 through 11, 14 and 15. 

18. The use of the heterologous fusion protein as claimed 
in any one of Claims 1 through 11, 14 and 15 for the 
manufacture of a medicament for the treatment of patients 
with insufficient circulating neutrophil levels. 

19. Use of a heterologous fusion protein of any one of 
Claims 1 through 11, 14, and 15 as a medicament. 

20. Use of a heterologous fusion protein of any one of 
Claims 1 through 11, 14, and 15 for the treatment of 
patients with insufficient circulating. neutrophil levels. 

21. A pharmaceutical formulation adapted for the treatment 
of patients with insufficient neutrophil levels comprising 
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heterologous fusion protein of any one of Claims - 1 • through 
11, 14, and 15. 

22. A heterologous fusion protein comprising a 
hyperglycosylated G-CSF analog fused to a polypeptide 
selected from the group consisting of 

a) the Fc portion of an immunoglobulin; 

b) an analog of the Fc portion of an immunoglobulin; 
and 

c) fragments of the Fc portion of an immunoglobulin. 

23. The heterologous fusion protein of Claim 22, wherein 
the hyperglycosylated G-CSF analog is fused to the 
polypeptide via a peptide linker. 

24. The heterologous fusion protein of the Claim 23 wherein 

the peptide linker is selected from the group consisting of: 

a) a glycine rich peptide; 

b) a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] „ 
where n is 1, 2, 3, 4, or 5; and 

c) a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] 3 . 

23. The heterologous fusion protein of Claims 22, 23 or 24, 
wherein the hyperglycosylated G-CSF analog comprises the 



amino < 


acid sequence 


Of 


the 


formula I 


: [SEQ 


ID 


NO: 


1] 


l 








5 










10 










15 




Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 








20 










25 










30 






Xaa 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


lie 


Gin 


Gly 


Asp 


Gly 


Ala 


Ala 


Leu 


Gin 






35 










40 










45 








Glu 


Lys 


Leu 


Cys 


Xaa 


Xaa 


Xaa 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 




50 










55 










60 










Leu 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


lie 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


65 










70 










75 










80 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 










85 










90 










95 




Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Xaa 


Xaa 


Xaa 


Ser 








100 










105 










110 






Xaa 


Glu 


Leu 


Gly 


Pro 


Thr 

* 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 






115 










120 










125 








Phe 


Ala 


Thr 


Thr 


lie 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


Met 


Ala 


Pro 




130 










135 










140 




■ 






Ala 


Leu 


Gin 


Pro 


Xaa 


Xaa 


Xaa 


Ala 


Met 


Pro 


Ala 


Phe 


Xaa 


Xaa 


Xaa 


Phe 


145 










150 










155 










160 
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Gln Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Aln Ser Phe 

165 170 
Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro (I) 



wherein: 

Xaa at position 

Xaa at position 

Xaa at position 

Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro ; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Xaa at position 
Pro; 

Xaa at position 
Xaa at position 
Xaa at position 
Pro ; and 
Xaa at position 



17 is Cys, Ala, Leu, Ser, or Glu; 
37 is Ala or Asn; 

3 8 is Thr, or any other amino acid except 

39 is Tyr, Thr, or Ser; 
57 is Pro or Val; 
5 8 is Trp or Asn; 

59 is Ala or any other amino acid except 

60 is Pro, Thr, Asn, or Ser, 

61 is Leu, or any other amino acid except 

62 is Ser or Thr; 

63 is Ser or Asn; 

64 is Cys or any other amino acid except 

65 is Pro, Ser, or Thr; 

66 is Ser or Thr; 

67 is Gin or Asn; 

68 is Ala or any other amino acid except 

69 is Leu, Thr, or Ser 

93 is Glu or Asn 

94 is Gly or any other amino acid except 

95 is lie, Asn, Ser, or Thr; 
97 is Pro, Ser, Thr, or Asn; 

133 is Thr or Asn; 

134 is Gin or any other amino acid except 

135 is Gly, Ser, or Thr 

141 is Ala or Asn; 

142 is Ser or any other amino acid except 

143 is Ala, Ser, or Thr; 



and wherein : 

Xaa at positions 37, 38, and 39 constitute region 1 

Xaa at positions 58, 59, and 60 constitute region 2 

Xaa at positions 59, 60, and 61 constitute region 3 

Xaa at positions 60, 61, and 62 constitute region 4 
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Aaa 




• j_ * 

pusi u ions 


£ 1 
Ol ( 






63 


constitute 


region 


=> ; 


Xaa 


at 


positions 


62, 


63, 


and 


64 


constitute 


region 


6; 


Xaa 


at 


positions 


63, 


64, 


and 


65 


constitute 


region 


7; 


Xaa 


at 


positions 


64, 


65, 


and 


66 


constitute 


region 


8; 


Xaa 


at 


positions 


67, 


68, 


and 


69 


constitute 


» 

region 


9; 


Xaa 


at 


positions 


93, 


94, 


and 


95 


constitute 


region 


10; 


Xaa 


at 


positions 


94, 


95, 


and 


Ser at position 96 





constitute region 11; 
Xaa at positions 95, and 97, and Ser at position 96 



constitute region 12; 
Xaa at positions 133, 134, and 135 constitute 
region 13; 

Xaa at positions 141, 142, and 143 constitute 
region 14 ; 

and provided that at least one of regions 1 through 14 
comprises the sequence Asn Xaal Xaa2 wherein Xaal is any 
amino acid except Pro and Xaa2 is Ser or Thr. 

26. The heterologous fusion protein of Claim 25 wherein any 
two regions of regions 1 through 14 comprise the sequence ' 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 

27. The heterologous fusion protein of Claim 25 wherein any 
three regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 

28. The heterologous fusion protein of Claim 25 wherein any 
four regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 
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29. The heterologous fusion protein of Claim 25 wherein the 
hyperglycosylated G-CSF analog is selected from the group 
consisting of : 



a) 


G- 


CSF[A37N,Y39T] 


b) 


G- 


CSF [P57V,W58N, P60T] 


c) 


G- 


CSF[P60N,S62T] 


d) 


G- 


CSF[S63N,P65T] 


e) 


G- 


CSF[Q67N,L69T] 


f ) 


G- 


CSF[E93N, I95T] 


g) 


G- 


•CSF[T133N,G135T] 


h) 


G- 


•CSF [A141N, A143T] 


l) 


G- 


•CSF [A37N, Y3 9T, P57V, W58N, P6 0T] 




G- 


-CSF[A37N,Y39T,P60N, S62T] 


k) 


G- 


■CSF [A37N, Y39T, S63N, P65T] 


1) 


G- 


■CSF [A37N, Y3 9T, Q6 7N, L69T] 


m) 


G- 


-CSF [A37N,Y39T,E93N, I95T] 


n) 


G- 


-CSF [A37N,Y39T, T133N, G135T] 


o) 


G- 


-CSF[A37N,Y3 9T,A141N, A143T] 


P) 


G- 


-CSF[A37N,Y3 9T, P57V, W58N, P60T, S63N, P6 5T] 


q) 


G- 


-CSF [A37N, Y3 9T, P57V, W58N, P60T,Q67N, L69T] 


r) 


G- 


-CSF [A37N, Y3 9T, S63N, P65T, E93N, I95T] . 



30. The heterologous fusion protein of Claim 29 wherein the 
hyperglycosylated G-CSF analog is G- 
CSF[A37N,Y39T,P57V,W58N / P60T, Q67N, L69T] . 

31. The heterologous fusion protein of Claim 29 wherein the 
hyperglycosylated G-CSF analog is G- 

CSF[A37N,Y39T,S63N,P65T,E93N, I95T] . 

32. The heterologous fusion "protein of any one of Claims 22 
through 31 wherein the second polypeptide is the Fc portion 
of an Ig selected from the group consisting of: IgGl, IgG2, 
IgG3, IgG4, IgE, IgA, IgD, or IgM. 
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33. The heterologous fusion protein of any one of Claims 22 
through 32 wherein the second polypeptide is the Fc portion 
of an Ig selected from the group consisting of: IgGl, IgG2, 
IgG3 , and IgG4 . 



34. The heterologous fusion protein of Claim 33 wherein the 
second polypeptide is the Fc portion of an IgGl 
immunoglobulin . 



35. The heterologous fusion protein of Claim 33 wherein the 
second polypeptide is the Fc portion of an IgG4 
immunoglobulin. 



36. The heterologous fusion protein of Claims 22 through 35 
wherein the IgG is human. 



37. The heterologous fusion protein of any one of Claims 22 
through 36 wherein the Fc portion comprises the hinge, CH2 , 
and CH3 domains. 



38. The heterologous fusion protein of Claim 34 wherein the 
polypeptide has the sequence of SEQ ID NO: 33. 



39. The heterologous fusion protein of Claim 34, wherein 
the polypeptide has the following nucleic acid sequence: 

tccaccaagggcccatcggtcttcccgctagcgccctgctccaggagcacctccgagagc 
acagccgccctgggctgcctgg 

tcaaggactacttccccgaaccggtgacggtgtcgtggaactcaggcgccctgaccagcg 
gcgtgcacaccttcccggctgtc 

ctacagtcctcaggactctactccctcagcagcgtggtgaccgtgccctccagcagcttg 
ggcacgaagacctacacctgcaac 

gtagatcacaagcccagcaacaccaaggtggacaagagagttgagtccaaatatggtccc 
ccatgcccaccctgcccagca 

cctgagttcctggggggaccatcagtcttcctgttccccccaaaacccaaggacactctc 
atgatctcccggacccctgaggtcac 

gtgcgtggtggtggacgtgagccaggaagaccccgaggtccagttcaactggtacgtgga 
tggcgtggaggtgcataatgcca 

agacaaagccgcgggaggagcagttcaacagcacgtaccgtgtggtcagcgtcctcaccg 
tcctgcaccaggactggctgaa 

cggcaaggagtacaagtgcaaggtctccaacaaaggcctcccgtcctccatcgagaaaac 
catctccaaagccaaagggca 
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gccccgagagccacaggtgtacaccctgcccccatcccaggaggagatgaccaagaacca 
ggtcagcctgacctgcctggtc 

aaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaac 
aactacaagaccacgcctccc 

gtgctggactccgacggctccttcttcctctacagcaggctaaccgtggacaagagcagg 
tggcaggaggggaatgtcttctcatgc 

tccgtgatgcatgaggctctgcacaaccactacacacagaagagcctctccctgtctctg 
ggtaaatga . 

40. A polynucleotide encoding a heterologous fusion protein 
of any one of Claims 1 through 389. 

41. A vector comprising the polynucleotide of Claim 40. 

42. A host cell comprising the vector of Claim 41. 

43. A host cell expressing at least one heterologous fusion 
protein of any one of Claims 1 through 39. 

44. The host cell of Claim 43 wherein said host cell is a 
CHO cell. 

45. A process for producing a heterologous fusion protein 
comprising the steps of transcribing and translating a 
polynucleotide of Claim 40 under conditions wherein the 
heterologous fusion protein is expressed in detectable 
amount s . 

46. A method for increasing neutrophil levels in a 
mammal comprising the administration of a therapeutically 
effective amount of the heterologous fusion protein of any 
one of Claims 27 through 36. 

47. The use of the heterologous fusion protein as claimed 
in any one of Claims 22 through 29 for the manufacture of a 
medicament for the treatment of patients with insufficient 
circulating neutrophil levels. 



WO 03/076567 



PCT/US03/03120 



-77- 

48. Use of a heterologous fusion protein of any one of 
Claims 22 through 29 as a medicament. 

49. Use of a heterologous fusion protein of any one of 
Claims 22 through 29 for the treatment of patients with 
insufficient circulating neutrophil levels. 

50. A pharmaceutical formulation adapted for the treatment 
of patients with insufficient neutrophil levels comprising a 
heterologous fusion protein of any one of Claims 22 through 
29. 

51. A heterologous fusion protein as hereinbefore described 
with reference to any one of the Examples. 
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SEQUENCE LISTING 

<110> Beals, John 

Kuchibhotla, Uma 

<120> HETEROLOGOUS G-CSF FUSION PROTEINS 

<130> P-15648 

<160> 66 

<170> Patentln version 3.1 

<210> 1 

<211> 174 

<212> PRT 

<213> Artificial Sequence 



«=220> 

<223> synthetic construct 
<220> 

<221> MIS COFEATURE 

<222> U7) . . (17) 

<223> Xaa at position 17 is Cys, Ala, Leu, Ser, or Glu; 



<220> 

<221> MISC_FEATURE 

<222> (37) . . (37) 

<223> Xaa at position 37 is Ala or Asn; 
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<220> 

<221> MIS C_FEATURE 

<222> (38).. (38) 

<223> Xaa at position 38 is Thr, or any other amino acid exept Pro; 
<220> 

<2-21> MIS COFEATURE 

<222> (39) . . (39) 

<223> Xaa at position 39 is Tyr, Thr, or Ser; 
<220> 

<221> MIS C_FEATURE 

<222> (57) . . (57) 

<223> Xaa at position 57 is Pro or Val; 
<220> 

<221> MIS C_FEATURE 

<222> (58) . . (58) 

<223> Xaa at position SB is Trp or Asn;* 

<220> 

<221> MISC_FEATURE 

<222> (59) . . (59) 

<223> Xaa at position 59 is Ala or any other amino acid except Pro; 
<220> 

<221> MIS COFEATURE 

<222> (60).. (60) 
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<223> Xaa at position 60 is Pro, Thr, Asn, or Ser; 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (61).. (61) 

<223> Xaa at position 61 is Leu, or any other amino acid except Pro; 



<220> 

<221> MI S COFEATURE 

<222> (62).. (62) 

<223> Xaa at position 62 is Ser or Thr; 



<220> 

<221> MI SC_FEATURE 

<222> (63) . . (63) 

<223> Xaa at position 63 Ser or Asn; 



220> 



<221> MISC FEATURE 



<222> (64).. (64) 



<223> Xaa at position 64 is Cys or any other amino acid except Pro; 



<220> 

<221> MIS COFEATURE 

<222> (65) . . (65) 

<223> Xaa at position 65 is Pro, Ser, or Thr; 
<220> 

<221> MISC FEATURE 
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<222> (66).. (66) 

<223> Xaa at position 66 is Ser or Thr; 

<220> 

<221> MIS COFEATURE 

<222> (67) . . (67) 

<223> Xaa at position 67 is Gin or Asn; 
<220> 

<221> MI SC_ FEATURE 

<222> (68) . . (68) 

<223> Xaa at position 68 is Ala or any other amino acid except Pro; 
<220> 

<221> MI S COFEATURE 

<222> (69) . . (69) 

<223> Xaa at position 69 is Leu, Thr, or Ser; 
<220> 

<221> MISC_FEATURE 

<222> (93) . . (93) 

<223> Xaa at position 93 is Glu or Asn; 
<220> 

<221> MIS C_FEATURE 

<222> (94) . . (94) 

<223> Xaa at position 94 is Gly or any other amino acid except Pro; 



<220> 
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<221> MIS COFEATURE 
<222> (95) . . (95) 

<223> Xaa at position 95 is lie, Asn, Ser, or Thr; 



<220> 

< 2 2 1 > MIS C_FEATURE 

<222> (97) . . (97) 

<223> Xaa at position 97 is Pro, Ser, Thr, or Asn; 



<220> 

<221> MIS C_FEATURE 

<222> (133) . . (133) 

<223> Xaa at position 133 is Thr or Asn; 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (134) . . (134) 

<223> Xaa at position 134 is Gin or any other amino acid except Pro; 
<220> 

<221> MIS C_FEATURE 

<222> (135) . . (135) 

<223> Xaa at position 135 is Gly, Ser, or Thr; 



<220> 

<221> MISC_FEATURE 

<222> (141) . . (141) 

<223> Xaa at position 141 is Ala or Asn; 
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<220> 



<221> MISC FEATURE 



<222> (142) . . (142) 



<223> Xaa at position 142 is Ser or any other amino acid except Pro; 



<220> 



<221> MISC FEATURE 



<222> (143) . . (143) 



<223> Xaa at position 143 is Ala, Ser, or Thr 



<400> 1 



Thr Pro Leu Gly Pro Ala 
1 5 



Ser Leu Pro Gin Ser Phe Leu Leu Lys 

10 15 



Xaa Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin 

20 25 30 



Glu Lys Leu Cys Xaa Xaa Xaa Lys Leu Cys His Pro Glu Glu Leu Val 

35 40 45 



Leu Leu Gly His Ser Leu Gly He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Xaa Xaa Xaa Xaa Xaa Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 

65 70 75 80 



Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Xaa Xaa Xaa Ser 

85 90 95 



Xaa Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 

100 105 110 



Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

115 120 125 



Ala Leu Gin Pro Xaa Xaa Xaa Ala Met Pro Ala Phe Xaa Xaa Xaa Phe 
130 135 140 
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Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 

165 170 



<210> 2 



<211> 1044 



<212> DNA 



<213> Artificial Sequence 



<220> 














<223> synthetic construct 










<400> 2 
acccccctgg 


gccctgccag 


ct ccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ct cgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtgccacc 


tacaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cacggtggat 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


aaccagaccg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtt 


ggtctggcgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 
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<210> 3 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 3 

acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtgccacc 


tacaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cacggtggat 


gttcgacacg 


gtggggctcc 


tcqaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcaactct 


accttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agttgagatg 


gaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 4 
<211> 1044 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 4 

acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


54 0 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


'tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 5 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 
<400> 5 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg €0 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 180 

cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 240 

ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 300 

tctctgggca tcccctgggc taacactagc agctgcgacg agcctgtgac agacccgtag 360 

gggacccgat tggactcctc gacgcccagc caggccctgc agctggcagg ctgcttgagc 420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 600 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 780 

acccagggtg ccatgccggc- cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 1020 



cagaattccg tggaacgggt cggg 



<210> 6 



<211> 1044 



<212> DNA 



<213> Artificial Sequence 



1044 



<220> 

<223> synthetic construct 
<400> 6 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 60 
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ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtgccacc 


tacaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cacggtggat . 


gttcgacacq 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctqqgca 


tcccctqqqc 


tcccctgagc 


aattgcgacg 


agcctgtgac 


agacccgtag 


360 


aaqacccQaQ 


qqqactcgtt 


aacqaccaqc 

3 — / 


caqgccctgc 


agctggcagg 


ctgcttgagc 


420 


csactccsta 


actcgtcqqt 

W W w 3 V V Jj 


CCqqqacqtC 

3 3 — / ^* 3 


qaccqtccga 


cgaactcggt 


tgaggtatcg 


480 


oacctt ttcc 


tctaccaqqq 


qctcctqcaq 


qccctqqaaq 

3 3 3 3 


ggatctcccc 

3 *J 


ggaaaaggag 


540 


ataatccccq 


aqqacqtccq 


qqaccttccc 


taqaqqcccq 


agttgggtcc 


caccttggac 


600 


acactacaac 


tqqacqtcqc 


cqacqqqctc 

J 333 **** 


aacccaqqqt 


ggaacctgtg 


tgacgtcgac 


660 


ctacaQCQQC 


tgtttgccac 


caccatctgq 


caqcaqatqq 

w W 3 33 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 

* 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 7 
<211> 1044 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 7 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 60 
ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 120 
cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 180 
cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 240 
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ctcttcgaca 


cacggtggat 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcgttaacgc 


taccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


caattgcgat 


gggactcgtc 

V 4^ m^ W 


gacgcccaqc 


caggccctgc 


agctqgcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccqggacqtc 


qaccqtccqa 


cqaactcqqt 


tqaqqtatcq 


480 


qgccttttcc 


tctaccaqqq 


qctcctqcaq 


qccctqqaaq 


qqat ctcccc 

j ^3 w ^ w w* w w w 


aqaaaaQQact 


540 


atQQtCCCCQ 


aqqacgtccq 


QQacct tccc 


t aaaaaccca 


aottcoatcc 


caccttaaac 


6 0 0 

\J \J \J 


acactocaQC 




caacQOQctc 


aaccrsnnnt 

d G W V- s— ■ y y W 


oaaarrtnto 


w y w y v \w m w 


D V V 




totttaccac 

w ~j www *n V* w* wi w> 


W* W W Q l_ ^ W y y 


w, ay ^. oy c* ^yy 


cidyadL Ly yy 


f* n r+ r* r+ h- 


TO ft 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt- 


cggg 








1044 


<210> 8 






+ 








<211> 1044 












<212> DNA 














<213> Artificial Sequence 










<220> 














<223> synthetic construct 










<400> 8 
acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


A - ^_ 

tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtgccacc 


tacaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cacggtggat 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


aacgccaccc 


agctggcagg 


ctgcttgagc 


420 
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caactccata gcgggtcgtt gcggtgggtc gaccgtccga cgaactcggt tgaggtatcg 4 80 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 600 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 1020 

• cagaattccg tggaacgggt cggg 1044 

<210> 9 
<211> 1044 
<212> DNA 

<213> Artificial Sequence _ 
<220> 

<223> synthetic construct 
<400> 9 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 180 

cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 240 

ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 300 

tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 360 

gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 480 

ggccttttcc tctaccaggg gctcctgcag gccctgaacg ggacctcccc ggaaaaggag 540 

atggtccccg aggacgtccg ggacttgccc tggaggcccg agttgggtcc caccttggac 600 
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acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 1020 



cagaattccg tggaacgggt cggg 



<210> 10 



<211> 1044 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> synthetic construct 



<400> 10 



1044 



acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 
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aaccagaccg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtt 


ggtctggcgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 11 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 11 



acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 

■* 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcaactct 


accttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agttgagatg 


gaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 
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ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 1020 
cagaattccg tggaacgggt cggg 1044 

<210> 12 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



<220> 














<223> synthetic construct 










<400> 12 
acccccctgg 


gccctgccag 


* 

ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcgttaacgc 


taccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


caattgcgat 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc -tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 
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<210> 13 
<211> 1044 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 



<400> 13 

acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


• 

gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct. gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


aacgccaccc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcgtt 


gcggtgggtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


■ 

accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 14 
<211> 1044 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> synthetic construct 



^ A ft n *s t 21 

acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtc 


gacgcccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggaacg 


gtaccggtcc 


caccttggac 


600 


acactgcagc 


tggacgtcgc 


cgacttgcca 


tggccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccaggctg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcae 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 15 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> synthetic construct 



400> 15 



acccccctgg 


gccctgccag 


ctccctgccc 


cagagcttcc 


tgctcaagtg 


gggggacccg 


60 


ggacggtcga 


gggacggggt 


ctcgaaggac 


gagttcgcct 


tagagcaagt 


gaggaagatc 


120 


cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


* 

tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcgttaacgc 


taccctgagc 


agctgcgacg 


agcctgtgac 


agacccgtag 


360 


caattgcgat 


gggactcgtc 


gacgcccagc 


aacgccaccc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gcgggtcgtt 


gcggtgggtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaggg 


gctcctgcag 


gccctggaag 


ggatctcccc 


ggaaaaggag 


540 


atggtccccg 


aggacgtccg 


ggaccttccc 


tagaggcccg 


agttgggtcc 


caccttggac 


600 


acactgcagc 


tagacgtcgc 


cgacgggctc 


aacccagggt 


ggaacctgtg 


tgacgtcgac 


660 


ctgcagcggc 


tgtttgccac 


caccatctgg 


cagcagatgg 


aagaactggg 


-«Bk < ^^p- ^*^m- *bb» *alBh *ariBk 

aatggcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


* 

ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 



<210> 16 



<211> 1044 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 16 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 60 
ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 120 



WO 03/076567 PCTYUS03/03120 

20/48 



cagggcgatg 


gcgcagcgct 


ccagcggaat 


ctcgttcact 


ccttctaggt 


cccgctaccg 


180 


cgtcgcgagg 


tcgagaagct 


gtgtaacacc 


accaagctgt 


gccaccccga 


ggagctggtg 


240 


ctcttcgaca 


cattgtggtg 


gttcgacacg 


gtggggctcc 


tcgaccacct 


gctcggacac 


300 


tctctgggca 


tcccctgggc 


tcccctgagc 


aattgcgacg 


agcctgtgac 


agacccgtag 


360 


gggacccgag 


gggactcgtt 


aacgaccagc 


caggccctgc 


agctggcagg 


ctgcttgagc 


420 


caactccata 


gctggtcggt 


ccgggacgtc 


gaccgtccga 


cgaactcggt 


tgaggtatcg 


480 


ggccttttcc 


tctaccaaqg 


gctcctgcag 


gccctgaacg 


ggacctcccc 


ggaaaaggaq 


540 


atqqtccccg 


aqqacqtccq 


qgacttqccc 


tqqaqqcccq 


aqttqqqtcc 


caccttqqac 


600 

V 


acactgcagc 


tggacgtcgc 


CqacqqqctC 


aacccagggt 


qqaacctqtq 


tqacqtcqac 


660 


ctqcaqcqqc 


tqtttqccac 


caccatctqq 


caacaqatqq 


aaqaactqqq 


aatqqcccct 


720 


aaacggtggt 


ggtagaccgt 


cgtctacctt 


cttgaccctt 


accggggagc 


cctgcagccc 


780 


acccagggtg 


ccatgccggc 


cttcgcctct 


gctttccggg 


acgtcgggtg 


ggtcccacgg 


840 


tacggccgga 


agcggagacg 


aaagcagcgc 


cgggcaggag- 


gggtcctggt 


tgcctcccat 


900 


ctgcagagct 


tcgtcgcggc 


ccgtcctccc 


caggaccaac 


ggagggtaga 


cgtctcgaag 


960 


ctggaggtgt 


cgtaccgcgt 


cttaaggcac 


cttgcccagc 


ccgacctcca 


cagcatggcg 


1020 


cagaattccg 


tggaacgggt 


cggg 








1044 


<210> 17 


* 












<211> 1762 












c212> DNA 














<213> Artificial Sequence 










<220> 








- 






<223> synthetic construct 










<400> 17 
gatgcgcaca 


agagtgaggt 


tgctcatcgg 


tttaaagatt 


tgggagaaga 


aaatttcaaa 


60 


gccttggtgt 


tgattgcctt 


tgctcagtat 


cttcagcagt 


gtccatttga 


agatcatgta 


120 


aaattagtga 


atgaagtaac 


tgaatttgca 


aaaacatgtg 


ttgctgatga 


gtcagctgaa 


180 


aattgtgaca 


aatcacttca 


tacccttttt 


ggagacaaat 


tatgcacagt 


tgcaactctt 


240 


cgtgaaacct 


atggtgaaat 


ggctgactgc 


tgtgcaaaac 


aagaacctga 


gagaaatgaa 


300 
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tgcttcttgc 


aacacaaaga 


tgacaaccca 


aacctccccc 


gattggtgag 


accagaggtt 


360 


gatgtgatgt 


gcactgcttt 


tcatgacaat 


gaagagacat 


ttttgaaaaa 


atacttatat 


420 


gaaattgcca 


gaagacatcc 


ttacttttat 


gccccggaac 


tccttttctt 


tgctaaaagg 


480 


tataaagctg 


cttttacaga 


atgttgccaa 


gctgctgat a 


aagctgcctg 


cctgttgcca 


540 


aagctcgatg 


aacttcggga 


tgaagggaag 


gcttcgtctg 


ccaaacagag 


actcaagtgt 


600 


qccagtctcc 


aaaaatttgg 


agaaagagct 


ttcaaagcat 


gggcagtagc 


tcgcctgagc 


660 


cagagatttc 


ccaaagctga 


gtttgcagaa 


gtttccaagt 


tagtgacaga 


tcttaccaaa 


720 * 


gtccacacgg 


aatgctgcca 


tggagatctg 


cttgaatgtg 


ctgatgacag 


ggcggacctt 


780 


qccaagtata 


tctgtgaaaa 


* 

tcaagattcg 


atctccagta 


aactgaagga 


atgctgtgaa 


840 


aaacctctgt 


tggaaaaatc 


ccactgcatt 


gccgaagtgg 


aaaatgatga 


gatgcctgct 


900 


gacttgcctt 


cattagctgc 


tgattttgtt 


gaaagtaagg 


atgtttgcaa 


aaactatgct 


960 


gaggcaaagg 


atgtcttcct 


gggcatgttt 


ttgtatgaat 


atgcaagaag 


gcatcctgat 


1020 


tactctgtcg 


tgctgctgct 


gagacttgcc 


aagacatatg 


aaaccactct 


agagaagtgc 


1080 


tgtgccgctg 


cagatcctca 


tgaatgctat 


gccaaagtgt 


tcgatgaatt 


taaacctctt 


1140 


gtggaagagc 


ctcagaattt 


aatcaaacaa 


aattgtgagc 


tttttgagca 


gcttggagag 


1200 


tacaaattcc 


agaatgcgct 


attagttcgt 


tacaccaaga 


aagtacccca 


agtgtcaact 


1260 


ccaactcttg 


tagaggtctc 


aagaaaccta 


ggaaaagtgg 


gcagcaaatg 


ttgtaaacat 


1320 


cctgaagcaa 


aaagaatgcc 


ctgtgcagaa 


gactatctat 


ccgtggtcct 


gaaccagtta 


1380 


tgtgtgttgc 


atgagaaaac 


gccagtaagt 


gacagagtca 


ccaaatgctg 


cacagaatcc 


1440 


u t y y uycaawa 




V k W k b W 0 w 1— 




ataaaacata 


cqttcccaaa 


1500 


gagtttaatg 


ctgaaacatt 


caccttccat 


gcagatatat 


gcacactttc 


tgagaaggag 


1560 


agacaaatca 


agaaacaaac 


tgcacttgtt 


gagctcgtga 


aacacaagcc 


caaggcaaca 


1620 


aaagagcaac 


tgaaagctgt 


tatggatgat 


ttcgcagctt 


ttgtagagaa 


gtgctgcaag 


1680 


gctgacgata 


aggagacctg 


ctttgccgag 


gagggtaaaa 


aacttgttgc 


tgcaagtcaa 


1740 


gctgccttag 


gcttataatg 


ac 
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<210> 18 
<211> 232 
<212> PRT 
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<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 18 

Ala Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro 
15 10 15 



Ala Pro Glu Lys Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 

20 25 30 



Lys Asp Thr Lys Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 

35 40 45 



Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 
50 55 60 



Asp Gly Val Glu Val His- Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
65 70 75 80 



Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin 

85 90 95 



Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 

100 105 110 



Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin Pro 

115 120 125 



Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr 
130- 135 140 



Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser 

145 150 155 160 



Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 

165 170 175 



Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 

' 180 185 190 
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Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 

195 200 205 



Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 

210 215 220 



Ser Leu Ser Leu Ser Pro Gly Lys 
225 230 



<210> 19 



<211> 229 



<212> PRT 



<213> Artificial Sequence 



<220> 



<223> synthetic contruct 



<400> 19 



Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Glu Phe 
i 5 10 15 



Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 

20 25 30 



Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val 

35 40 45 



Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp Gly Val 
50 55 60 



Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Asn Ser 

65 70 75 80 



Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu 

85 90 95 



Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser 

100 105 HO 



Ser He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro 

115 120 125 
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Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys Asn Gin 

130 135 140 



Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala 

145 150 155 160 



Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr 

165 170 175 



Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu 

180 185 190 



Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser Cys Ser 

195 200 205 



Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser 
210 215 220 



Leu Ser Leu Gly Lys 
225 



<210> 20 



<211> 585 



<212> PRT 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 20 

Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu 
15 10 15 



Glu Asn Phe Lys Ala Leu Val Leu lie Ala Phe Ala Gin Tyr Leu Gin 

20 25 30 



Gin Cys Pro Phe Glu Asp His Val Lys Leu Val Asn Glu Val Thr Glu 

35 40 45 
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Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu Asn Cys Asp Lys 
50 55 60 



Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu 

65 70 75 80 



Arg Glu Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pro 

85 90 95 



Glu Arg Asn Glu Cys Phe Leu Gin His Lys Asp Asp Asn Pro Asn Leu 

100 105 110 



Pro Arg Leu Val Arg Pro Glu Val Asp Val Met Cys Thr Ala Phe His 

115 120 125 



Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr Glu lie Ala Arg 

130 135 140 



Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg 
145 150 155 160 



Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Ala 

165 ±70 175 



Cys Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser 

180 185 190 



Ser Ala Lys Gin Arg Leu Lys Cys Ala Ser Leu Gin Lys Phe Gly Glu 

195 200 205 



Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser Gin Arg Phe Pro 
210 215 220 



Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys 

225 230 235 240 



Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp 

245 250 255 



Arg Ala Asp Leu Ala Lys Tyr lie Cys Glu Asn Gin Asp Ser He Ser 

260 265 270 



Ser Lys Leu Lys Glu Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His 

275 280 285 
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Cys lie Ala Glu Val Glu Asn Asp Glu Met Pro Ala Asp Leu Pro Ser 
290 295 300 



Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala 
305 310 315 320 



Glu Ala Lys Asp Val Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg 

325 330 335 



Arg His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr 

340 345 350 



Tyr Glu Thr Thr Leu Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu 

355 360 365 



Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu Val Glu Glu Pro 
370 375 380 



Gin Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Glu Asn Leu Gly Glu 
385 390 395 400 



Tyr Lys Phe Gin Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro 

405 410 415 



Gin Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys 

420 425 430 



Val Gly Ser Lys Cys Cys Lys His Pro Glu Ala Lys Arg Met Pro Cys 

435 440 445 



Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gin Leu Cys Val Leu His 

450 455 460 



Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser 

465 470 475 480 



Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr 

485 490 495 



Tyr Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp 

500 505 510 
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lie Cys Thr Leu Ser Glu Lys Glu Arg Gin lie Lys Lys Gin Thr Ala 

515 520 525 

Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr Lys Glu Gin Leu 
530 535 540 

Lys Ala Val Met Asp Asp Phe Ala Ala Fhe Val Glu Lys Cys Cys Lys 
545 550 555 560 

Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu* Glu Gly Lys Lys Leu Val 

565 570 575 

Ala Ala Ser Gin Ala Ala Leu Gly Leu 

580 585 

<210> 21 
<211> 703 
<212> DNA 

<213> Artificial Seguence 



<220> 

<223> synthetic construct 
<400> 21 



gagcccaaat 


cttgtgacaa 


aactcacaca 


tgcccaccgt 


gcccagcacc 


tgaactcctg 


60 


gggggaccgt 


cagtcttcct 


cttcccccca 


aaacccaagg 


acaccctcat 


gatctcccgg 


120 


acccctgagg 


tcacatgcgt 


ggtggtggac 


gtgagccacg 


aagaccctga 


ggtcaagttc 


180 


aactggtacg 


tggacggcgt 


ggaggtgcat 


aatgccaaga 


caaagccgcg 


ggaggagcag 


240 


tacaacagca 


cgtaccgtgt 


ggtcagcgtc 


ctcaccgtcc 


tgcaccagga 


ctggctgaat 


300 


ggcaaggagt 


acaagtgcaa 


ggtctccaac 


aaagccctcc 


cagcccccat 


cgagaaaacc 


360 


atctccaaag 


ccaaagggca 


gccccgagaa 

* 


ccacaggtgt 


acaccctgcc 


cccatcccgg 


420 


gaggagatga 


ccaagaacca 


ggtcagcctg 


acctgcctgg 


tcaaaggctt 


ctatcccagc 


480 


gacatcgccg 


tggagtggga 


gagcaatggg 


cagccggaga 


acaactacaa 


gaccacgcct 


540 


cccgtgctgg 


actccgacgg 


ctccttcttc 


ctctatagca 


agctcaccgt 


ggacaagagc 


600 


aggtggcagc 


aggggaacgt 


cttctcatgc 


tccgtgatgc 


atgaggctct 


gcacaaccac 


660 


tacacgcaga 


agagcctctc 


cctgtctccg 


ggtaaatgat 


agt 


* 


703 
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PCT/US03/03120 



<210> 22 

<211> 981 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 22 

tccaccaagg gcccatcggt cttcccgcta gcgccctgct ccaggagcac ctccgagagc 60 

acagccgccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 120 

aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 180 

ctctactccc tcagcagcgt ggtgaccgtg ccctccagca gcttgggcac gaagacctac 240 

acctgcaacg tagatcacaa gcccagcaac accaaggtgg acaagagagt tgagtccaaa 300 

tatggtcccc catgcccacc ctgcccagca cctgagttcc tggggggacc atcagtcttc 360 

ctgttccccc caaaacccaa ggacactctc atgatctccc ggacccctga ggtcacgtgc 420 

gtggtggtgg acgtgagcca ggaagacccc gaggtccagt tcaactggta cgtggatggc 480 

gtggaggtgc ataatgccaa gacaaagccg cgggaggagc agttcaacag cacgtaccgt 540 

gtggtcagcg tcctcaccgt cctgcaccag gactggctga acggcaagga gtacaagtgc 600 

+■ 

a-aggtctcca acaaaggcct cccgtcctcc atcgagaaaa ccatctccaa agccaaaggg 660 

cagccccgag agccacaggt gtacaccctg cccccatccc aggaggagat gaccaagaac 720 

caggtcagcc tgacctgcct ggtcaaaggc ttctacccca gcgacatcgc cgtggagtgg 780 

gagagcaatg ggcagccgga gaacaactac aagaccacgc ctcccgtgct ggactccgac 840 

ggctccttct tcctctacag caggctaacc gtggacaaga gcaggtggca ggaggggaat 900 

gtcttctcat gctccgtgat gcatgaggct ctgcacaacc actacacaca gaagagcctc 960 

tccctgtctc tgggtaaatg a 981 

<210> 23 
<211> 406 
<212> PRT 
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<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 23 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
15 10 15 



Ala Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 

20 25 30 



Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 

35 40 45 



Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 



Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 



Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 

85 90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 

100 105 110 



Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 

145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Glu Pro 

165 170 175 



Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu 

180 185 190 



4 
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Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp 

195 200 205 



Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 
210 215 220 



Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly 

225 230 235 240 



Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn 

245 250 255 



Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp 

260 265 270 



Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro 

275 280 285 



Ala Pro He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg Glu 
290 295 300 



Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn 
305 310 315 320 



Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp He 

325 330 335 



Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr 

340 345 350 



Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys 

355 360 365 



Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys 
370 375 380 



Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu 
385 390 395 400 



Ser Leu Ser Pro Gly Lys 

405 



<210> 24 



WO 03/076567 



PCT/US03/03120 



31/48 



<211> 403 



<212> PRT 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 24 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 

15 10 15 



Ala Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 

20 25 30 



Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 

35 40 45 



Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 



Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 

65 70 75 80 



Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 

85 90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 

100 105 110 



Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Arg. Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 

145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Glu Ser 

165 170 175 



WO 03/076567 



Lys Tyr G.ly Pro 

180 



Gly Pro Ser Val 

195 



lie Ser Arg Thr 
210 



Glu Asp Pro Glu 
225 



His Asn Ala Lys 



Arg Val Val Ser 

260 



Lys Glu Tyr Lys 

275 



Glu Lys Thr lie 
290 



Tyr Thr Leu Pro 
305 



Leu Thr Cys Leu 



Trp Glu Ser Asn 

340 



Val Leu Asp Ser 

355 



Asp Lys Ser Arg 
370 



His Glu Ala Leu 
385 



32/48 



Pro Cys Pro Ser Cys 

185 



Phe Leu Phe Pro Pro 

200 



Pro Glu Val Thr Cys 

215 



Val Gin Phe Asn Trp 
230 



Thr Lys Pro Arg Glu 
245 



Val Leu Thr Val Leu 

265 



Cys Lys Val Ser Asn 

280 



Ser Lys Ala Lys Gly 

295 



Pro Ser Gin Glu Glu 
310 



Val Lys Gly Phe Tyr 
325 



Gly Gin Pro Glu Asn 

345 



Asp Gly Ser Phe Phe 

360 



Trp Gin Glu Gly Asn 

375 



His Asn His Tyr Thr 
390 



Pro Ala Pro Glu 



Lys Pro Lys Asp 

205 



Val Val Val Asp 

220 



Tyr Val Asp Gly 
235 



Glu Gin Phe Asn 
250 



His Gin Asp Trp 



Lys Gly Leu Pro 

285 



Gin Pro Arg Glu 

300 



Met Thr Lys Asn 
315 



Pro Ser Asp lie 
330 



Asn Tyr Lys Thr 



Leu Tyr Ser Arg 

365 



Val Phe Ser Cys 

380 



Gin Lys Ser Leu 
395 



PCT/US03/03120 



Phe Leu Gly 
190 



Thr Leu Met 



Val Ser Gin 



Val Glu Val 

240 



Ser Thr Tyr 
255 



Leu Asn Gly 

270 



Ser Ser lie 



Pro Gin Val 



Gin Val Ser 

320 



Ala Val Glu 
335 



Thr Pro Pro 
350 



Leu Thr Val 



Ser Val Met 



Ser Leu Ser 

400 



Leu Gly Lys 



WO 03/076567 



33/48 



PCTYUS03/03120 



<210> 25 



<211> 500 



<212> PRT 



<213> Artificial Sequence 



<220> 



<223> synthetic construct 



<400> 25 



Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
1 5 10 15 



Ala Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 

20 25 30 



Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 

35 40 45 



Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 



Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 



Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 

85 90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 

100 105 HO 



Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 

145 150 155 160 



WO 03/076567 



34/48 



PCT7US03/03120 



Leu Glu Val Ser 



Gly Gly Ser Gly 

180 



Lys Ser Glu Val 

195 



Lys Ala Leu Val 

210 



Phe Glu Asp His 
225 



Thr Cys Val Ala 



Thr Leu Phe Gly 

260 



Tyr Gly Glu Met 

275 



Glu Cys Phe Leu 
290 



Val Arg Pro Glu 
305 



Glu Thr Phe Leu 



Tyr Phe Tyr Ala 

340 



Ala Phe Thr Glu 

355 



Pro Lys Leu Asp 
370 



Gin Arg Leu Lys 
385 



Tyr Arg Val Leu 
165 



Gly Gly Gly Ser 



Ala His Arg Phe 

200 



Leu lie Ala Phe 

215 



Val Lys Leu Val 
230 



Asp Glu Ser Ala 

245 



Asp Lys Leu Cys 



Ala Asp Cys Cys 

280 



Gin His Lys Asp 

295 



Val Asp Val Met 
310 



Lys Lys Tyr Leu 
325 



Pro Glu Leu Leu 



Cys Cys Gin Ala 

360 



Glu Leu Arg Asp 

375 



Cys Ala Ser Leu 
390 



Arg His Leu Ala 

. 170 



Gly Gly Gly Gly 
185 



Lys Asp Leu Gly 



Ala Gin Tyr Leu 

220 



Asn Glu Val Thr 

235 



Glu Asn Cys Asp 

250 



Thr Val Ala Thr 
265 



Ala Lys Gin Glu 



Asp Asn Pro Asn 

300 



Cys Thr Ala Phe 

315 



Tyr Glu lie Ala 
330 



Phe Phe Ala Lys 

345 



Ala Asp Lys Ala 



Glu Gly Lys Ala 

380 



Gin Lys Phe Gly 

395 



Gin Pro Gly Gly 

175 



Ser Asp Ala His 
190 



Glu Glu Asn Phe 

205 



Gin Gin Cys Pro 



Glu Phe Ala Lys 

240 



Lys Ser Leu His 

255 



Leu Arg Glu Thr 
270 



Pro Glu Arg Asn 
285 



Leu Pro Arg Leu 



His Asp Asn Glu 

320 



Arg Arg His Pro 

335 



Arg Tyr Lys Ala 
350 



Ala Cys Leu Leu 
365 



Ser Ser Ala Lys 



Glu Arg Ala Phe 

400 



WO 03/076567 
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PCTAJS03/03120 



Lys Ala Trp Ala Val Ala Arg Leu Ser Gin Arg Phe Pro Lys Ala Glu 

405 410 415 



Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys Val His Thr 

420 425 430 



Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp 

435 440 445 



Leu Ala Lys Tyr lie Cys Glu Asn Gin Asp Ser He Ser Ser Lys Leu 

450 455 460 



Lys Glu Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His Cys He Ala 
465 470 475 480 



Glu Val Glu Asn Asp Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala 

485 490 495 



Asp Phe Val Glu 

500 



<210> 26 



<211> 69 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 26 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 
atgaagctg 



<210> 27 



<211> 61 



<212> DNA 



<213> Artificial Sequence 



WO 03/076567 



PCT7US03/03120 



36/48 



<220> 



<223> synthetic construct 



<400> 27 

ggggcaggga gctggctggg cccagtggag tggcttcctg cactgtccag agtgcactgt 



60 



9 



61 



<210> 28 

<211> 59 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 28 

ggacagtgca ggaagccact ccactgggcc cagccagctc cctgccccag agcttcctg 59 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 29 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 60 
caggaagctc tg 72 



<210> 



29 



<211> 



72 



<210> 



30 



<211> 



69 



<212> 



DNA 



<213> Artificial Sequence 



WO 03/076567 PCT/US03/03120 

37/48 



<220> 

<223> synthetic construct 
<400> 30 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 60 



atgaagctg 



69 



<210> 31 



<211> 57 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> synthetic construct 



<400> 31 

gctctaaggc cttgagcagg aagctctggg gcagggagct cgctgggccc agtggag 57 



<210> 32 



<211> 53 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> synthetic construct 



<400> 32 

gggcccagcg agctccctgc cccagagctt cctgctcaag gccttagagc aag 53 



<210> 33 



<211> 72 



<212> DNA 



<213> Artificial Sequence 



WO 03/076567 PCT/US03/03120 

38/48 



<220> 

<223> synthetic construct 
<400> 33 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 60 



caggaagctc tg 



72 



<210> 34 



<211> 69 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 34 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 60 



atgaagctg 



69 



<210> 35 



<211> 61 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 35 

gtccgagcag cactagttcc tcggggtggc acagcttggt ggtgttacac agcttctcct 60 
g 61 



<210> 36 



<211> 66 



<212> DNA 



<213> Artificial Sequence 



WO 03/076567 PCT/OS03/03120 

39/48 



<220> 



<223> synthetic construct 



<400> 36 

ggcgcsgcgc tccaggagaa gctgtgtaac accaccaagc tgtgccaccc cgaggaacta 60 



gtgctg 



66 



<210> 37 



<211> 72 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> synthetic construct 



<400> 37 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 60 



caggaagctc tg 



72 



<210> 38 



<211> 69 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 

* 

<400> 38 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 6 0 



atgaagctg 



69 



210> 39 



<211> 61 



<212> DNA 



WO 03/076567 PCT/US03/03120 

40/48 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 39 

gcccggcgct ggaaagcgct ggcgaaggcc ggcatggcgg tctggttggg ctgcagggca 60 

g 6i 



<210> 40 



<211> 60 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 40 

ggcccctgcc ctgcagccca accagaccgc catgccggcc ttcgccagcg ctttccagcg 60 



<210> 41 



<211> 72 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 41 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc. 60 



caggaagctc tg 



72 



<210> 42 



<211> 69 



<212> DNA 



WO 03/076567 PCT/US03/03120 

41/48 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 42 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 60 



atgaagctg 



69 



<210> 43 



<211> 68 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 43 

gcccggcgct ggaaggtaga gttgaaggcc ggcatggcac cctgggtggg ctgaagagca 60 



ggggccat 



68 



210> 44 



<211> 74 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> synthetic construct 



<400> 44 

gggaatggcc cctgctcttc agcccaccca gggtgccatg ccggccttca actctacctt 60 



ccagcgccgg gcag 



74 



210> 45 



211> 72 



WO 03/076567 PCT/US03/03120 

42/48 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 45 

■ 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 60 



caggaagctc tg 



72 



<210> 46 



<211> 20 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 46 

gctagcggcg cgccaccatg 



20 



<210> 47 



<211> 33 



<212> GJJA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 47 

gctcagggta gcgttaacga tgcccagaga gtg 33 



<210> 48 



<211> 30 



<212> DNA 



WO 03/076567 



PCT/US03/03120 



43/48 



<213> 



Artificial Sequence 



<220> 



<223> 



synthetic construct 



<400> 48 

gggcatcgtt aacgctaccc tgagcagctg 



30 



<210> 49 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 49 

gactcgagga tcctcattag ggctggg 27 

<210> 50 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 50 

gctagcggcg cgccaccatg gccggacctg ccacccag 38 

<210> 51 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



WO 03/076567 



PCT/US03/03120 



44/48 



<220> 



<223> synthetic construct 



<400> 51 

caagcagccg gccagctggg tggcgttgct ggggcagctg ctcag 



45 



<210> 52 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 52 

gccccagcaa cgccacccag ctggccggct gcttgag 37 

<210> 53 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 53 

gactcgagga tcctcattag ggctgggcaa ggtgccttaa gacgcgg 47 



<210> 



54 



<211> 



20 



<212> DNA 



<213> Artificial Sequence 



<220> 
<223> 



synthetic construct 



WO 03/076567 



45/48 



PCT/US03/03120 



<400> 54 

gctagcggcg cgccaccatg 



<210> 55 



<211> 27 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 55 

ggggcaacta gtcaggttag cccaggg 



<210> 56 



<211> 27 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 

c400> 56 

gctaacctga ctagttgccc cagccag 



<210> 57 



<211> 27 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 57 

gactcgagga tcctcattag ggctggg 



20 



27 



27 



27 



WO 03/076567 



PCT/US03/03120 



46/48 



<210> 58 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<:400> 58 

gctagcggcg cgccaccatg 2 

<210> 59 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 59 

ggtgcaattg ctcaggggag cccag 2 

<210> 60 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 60 

gcaattgcac cagccaggcc ctg 2 



<210> 



61 



<211> 



27 



WO 03/076567 



47/48 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 61 

gactcgagga tcctcattag ggctggg 

<210> 62 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 62 

gctagcggcg cgccaccatg gccggacctg ccacccag 

<210> 63 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 63 

ccggactggt cccgttcagg gcctgcagga gcccctg 

<210> 64 

<211> 35 

<212> DNA 

<213> Artificial Sequence 



WO 03/076567 



PCT/US03/03120 



48/48 



<220> 



<223> synthetic construct 



<400> 64 

gaacgggacc agtccggagt tgggtccc&c cttgg 



35 



<210> 65 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 65 

gactcgagga tcctcattag ggctgggcaa ggtgccttaa gacgcgg 47 

<210> 66 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 66 

gtcgacgcta gcggcgcgcc accatggccg gacctg 



36 



